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Bell Laboratories Scientists Named 
1978 Nobel Prize Laureates 


Arno A. Penzias, director of the Radio Research Laboratory, and 
Robert W. Wilson, head of the Radio Physics Research Department, at 
Bell Laboratories, Holmdel, N.J., have been named co-winners of the 
1978 Nobel Prize in Physics, jointly with Professor Pyotr Kapitsa of the 
Academy of Sciences, Moscow. The Bell Labs scientists are receiving 
the award for their discovery of cosmic microwave background radiation, 
and Kapitsa is being cited for his basic work in low-temperature phys- 
ics. 

In 1964, Penzias and Wilson began using the most sensitive radio as- 
tronomy antenna available, assembled at Crawford Hill initially for 
satellite communications studies on Echo and Telstar and subsequently 
for a project they hoped would improve our understanding of the Milky 
Way. They concluded that what first appeared to be a faint noise signal 
was the background radiation (3°K) remaining from a cosmic explosion 
that gave birth to the universe some 20 billion years ago. 

“One of the consequences of [the big bang] theory is that the heat from 
the explosion ... should be left over and should be barely detectable,” 
Penzias said. 

The significance of the theory confirmed by their discovery, Wilson 
said, “is that the universe had a definite origin. During the first few 
minutes, when hydrogen and helium were being formed in the universe, 
there was a critical time when the ratio of certain elements was set. The 
present temperature and density of matter in the universe are an indi- 
cation of what must have gone on billions of years ago.” 

Following in the tradition of Karl Jansky, the founder of radio as- 
tronomy, Wilson, Penzias, and their colleagues have recently introduced 
millimeter-wave technology to the study of radio astronomy spectra, thus 
increasing the useful spectrum. An important early consequence of this 
achievement was the discovery of dozens of chemical compounds in in- 
terstellar space. 

Continuing Bell Labs pioneering efforts in microwave radio commu- 
nications and radio astronomy, Penzias and Wilson and other members 
of the Radio Research Laboratory have been applying astronomical 
techniques to the measurement of earth-space signal propagation. Using 
a new millimeter-wave antenna at Crawford Hill, they are gathering more 
comprehensive data than ever before on the effects of weather on high- 
frequency signals. 


Wilson and Penzias are the sixth and seventh Bell Labs scientists to 
receive the Nobel Prize in Physics. In 1987, Clinton C. Davisson won the 
award for discovery of the wave nature of matter. The Nobel Prize was 
awarded in 1956 to John Bardeen, Walter Brattain, and William 
Shockley for the transistor. Last year, Philip W. Anderson, consulting 
director in Physics Research, was awarded the Nobel Prize for his the- 
oretical studies of magnetism and disordered systems. 
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Robert W. Wilson, left, and Arno A. Penzias, Nobel Prize Laureates 
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Calculation of Steady-State Probabilities for 
Content of Buffer with Correlated Inputs 


By W. A. MASSEY and J. A. MORRISON 
(Manuscript received March 2, 1978) 


In a previous paper, a model for the behavior of a switching node 
that receives data from many terminals over low-speed access lines was 
considered. In this paper, we give the details of an alternate procedure 
for calculating the steady-state probabilities for the buffer content. It 
is shown that a finite system of linear equations may be obtained for 
calculating the steady-state probability that the buffer content is i. In 
a particular case of interest, explicit formulas are derived for the 
number of equations which arise in this procedure, for each value of i. 
Some detailed calculations are given for one example. 


I. INTRODUCTION 


Mathematical models for the behavior of a switching node that re- 
ceives data from a (large) number of terminals over low-speed access lines 
have been considered by Gopinath and Morrison,!? and some particular 
examples have been investigated by Fraser, Gopinath, and Morrison.? 
In this paper, we consider one of the models and give the details of an 
alternate procedure, which was alluded to by Gopinath and Morrison,! 
for calculating certain steady-state probabilities. 

We first describe the model which we will consider. It is assumed that 
the data are received at the switching node in the form of packets of fixed 
size. As the packets arrive, they are placed in a buffer, which is a first- 
in-first-out queue. The buffer processes packets at a uniform rate, pro- 
vided that it is not empty. In an actual computer network, the buffer 
capacity is finite, and a packet is lost if the buffer is full when it attempts 
to enter it. In our mathematical model, it is assumed that the buffer has 
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infinite capacity, so that no overflow is possible, and we are interested 
in calculating the steady-state probability that the buffer content (i.e., 
the number of packets in the buffer) exceeds the proposed capacity of 
the buffer. 

We let the time that it takes for the buffer to process a packet through 
the node be our unit of time. We suppose that £, is the number of packets 
which enter the buffer in the time interval (n,n + 1]. If b,, denotes the 
buffer content at time n, then the buffer content at time n + 1 is given 
by the equation 

bn41 = (bn ~ 1)* + Ens (1) 
where at = max(a,0). The quantity &, is arandom variable, and hence 
so is by. 

Consider the case in which each message from a terminal consists of 
exactly two packets which are separated by k units of time, where k is 
an integer. The packets are spread apart since the speed of the access 
lines is slower than the buffer processing rate. If x, denotes the number 
of first packets entering the buffer in the interval (n,n + 1], then &, = 
Xn + Xn—p, Since X,—, is the number of second packets entering in this 
interval which belong to messages whose first packets entered k intervals 
earlier. It was shown,!? under suitable conditions, that if the number 
of terminals is large, then it is a reasonable approximation to assume that 
the random variables x; are independently and identically distributed 
(i.i.d.). 

A generalization of the above model was considered,! in which the 
number of packets entering the buffer in the interval (n,n + 1] is 


k 
En = a AjXn—-j, (2) 
J=0 

where the nonnegative integer valued random variables x; are i.i.d. and 
the constant coefficients a; are nonnegative integers. It is assumed, 
without loss of generality, that a9 ~ 0 ¥ az. This is the model which we 
consider in this paper. It corresponds to a fixed pattern for each message. 
A more general model was considered? which allows for randomness in 
the message pattern, e.g., a random number of packets in a message. It 
would be of interest to obtain results for the more general model, anal- 
ogous to those derived in this paper for the model corresponding to (2). 
This could be the topic of a future paper. 

The results are stated and proved in a series of propositions, lemmas, 
theorems, and corollaries. In Section II, an explicit expression is first 
given for the steady-state probability that the buffer is empty, under 
the assumption that the mean arrival rate is less than unity. The 
steady-state probability that the buffer content is 1 is expressed in terms 
of the steady-state probabilities corresponding to a certain (k + 1)- 
dimensional Markov process. Criteria for the proper states of this process 
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are obtained, and it is shown that, for fixed k, a finite number of these 
states correspond to a prescribed buffer content. The fundamental 
relation satisfied by the steady-state probabilities corresponding to the 
(k + 1)-dimensional process is derived. 

In Section III, it is shown how this fundamental relation may be it- 
erated, so as to obtain a finite system of linear equations for calculating 
the steady-state probabilities corresponding to a prescribed buffer 
content. Numerous subsidiary quantities are defined to establish the 
required reduction formulas. The use of the reduction formulas to obtain 
the desired steady-state probabilities is described in Section IV. 

In Section V, attention is turned to the particular case &, = X, + Xn-—R; 
so that ao = 1 = ax, and a; = 0 otherwise, in (2). Explicit formulas are 
derived for the number of equations which occur in the calculation of 
the steady-state probabilities corresponding to a prescribed buffer 
content. In Section VI, the steady-state probabilities corresponding to 
an empty buffer are calculated in the case k = 4. 


Il. THE FUNDAMENTAL RELATION 


We assume that the mean arrival rate at the buffer is less than unity, 
and we are interested in determining the quantities 


x; = lim Pr(b, = 1), (3) 


ae 
where b,, satisfies (1) subject to (2). Hence, x; is the steady-state prob- 
ability that the buffer content is i. We will see that the determination 
of these quantities involves the determination of certain other steady- 
state probabilities, as discussed by Gopinath and Morrison.! It was 
proved? that all these steady-state probabilities exist. We proceed to 
state, and prove, the results in a series of propositions, lemmas, theorems, 
and corollaries. We first give an explicit expression! for xo, the steady- 
state probability that the buffer is empty. 


Proposition 1: xo = 1 — pp (x) where wp, = 08-9 a; and E(x) is the ex- 
pectation of any Xp. 


(This result may be derived by solving (77) in Ref. 2 for the marginal 
generating function ¢;(s), and letting s — 1. This was the method of 
proof used in Ref. 1.) 

We remark that, from (2), E(&,) = up (x). Note that our assumption 
that the mean arrival rate is less than unity implies that xo > 0. 

To determine the other «;’s, it will be convenient to use the following 
quantities: 


k 
0 = > aXntr—i-1 forr=1,...,k. (4) 


i=r 


Since we are using the first packets of a message to count the number 
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of intermediate packets at some later time, the 9” correspond in this 
sense to the packet contribution prior to time n to £,4,-1. We will de- 
termine the x;’s by exploiting the recursive relations between D,, &,, and 
9”), 
Let Z/ be the direct sum of a countable number of copies of Z, the set 
of integers. For /, a nonnegative integer, we define the following collection 
of subsets: 
N! = {(no,...,7,0,...)|no,... 7 = O}. 


Clearly, we have N°CN!C...CZ/. We now define a random k + 
1-tuple variable 


Be ib).25.0 0s, 2), (5) 


Using B,, we can define a map U that sends Z! into [0,1]. Given me Z/, 
with m = (mo,m 1, .. .), we define 


U(m) = lim Pr(B, = m). (6) 


n-o 


We can then recover any «; from the U(m)’s via the relation 


w= 2) Um). (7) 
mo=l 
This summation looks unwieldy, but we will show that this is not the 
case. 
We first establish 


Proposition 2: (by—-) — 1)* + okey En-i S$ bn 41-1 forl = 1. 


Proof: Use induction on |. 

(1 = 1) (6n-1 — 1)* + &n-1 = On, from (1). 
(l>1+1) Note that (b,-; — 1) S (b,-; — 1)*, and hence, using 
(1), 


I+1 I 
(b,-1-1 — I) + a En—i = bp + x Eni 


l 
SMOne DE 2 nse SP lee 


(The double asterisk is used throughout the paper to denote the end of 
a proof.) 
We now prove 


Theorem 3: U(m) # 0 implies that me N®, a, divides m; and, for 
[=1,...,R, 


M~ 


i=1 


l 
Mp-it1 SY ag! ap-i4j(mo9 + j — 1). 
ji 
Proof: To have a nonzero probability that B, = m, it is immediate that 
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me N®. Also, 0) = ayxn—1 = mp, and for a nonzero probability, x,—1 
must take on an integer value. 
Using Proposition 2, we have for! =1,...,k, 


l lek l 
be ri Le Ss gn-i = AjXn-i-j 2 ao pe comers 
i=1 i=1 j=0 = 
Therefore, 
l 
2 Mees ag (bn aed) (8) 
t=1 
Now 
. l k 
One ye Dd Oj Xntk-i-j 
=1 (3) jek 
R l 


oe Oj Xn+k-i-j. 
jek—l41 i=k+1-j 
Hence, if we make the substitutions ] = 7+k —landi =a0+l1-—7, we 


obtain 


l : l T 
SOFA SY apy ( 3 Sno) 
1=1 o=1 


ae Gent = 1); 
T=1 
using (8). 
So, if 0° = m, and b, = mo, the m,’s must satisfy these conditions. .. 


Corollary 4: For fixed mo and k, there can only be a finite number of m 
such that U(m) =~ 0. 

Such m that satisfy the criteria of Theorem 3 will be called proper 
states. From (7), each x; then is the sum over only a finite number of 
these. 

To derive the fundamental relation satisfied by U(m) we need 


Proposition 5: 0%) = aXn + 0,7) forr =1,...,k — Land 0%, = 


ApXn- 
Proof: From (4), forr =1,...,k — 1, 


k 
r —_— 
On} = AiXn+r-i 
i=r 
k 
= ApXn + Ds QiXn+r+1-i-1 
i=r+1 


= a-Xn + ae 


and 6), = a,x, by definition. ,. 
We now define a map from Z! into itself called T,, where y is a non- 
negative integer: 


T,(m) = R(m) + (7, — (y — 1)*,0,...), 


STEADY-STATE PROBABILITIES OF BUFFER 3101 


where R is the right shift operator. More explicitly, we can write 

Ty (m) = (y,mo — (y — 1)t,m,, . . .). (9) 
Theorem 6: 

U(m) = p(o) © y=0 U(T,(m — ovo)), 


where o = aj; !mp,p(a) = Pr(xn = 0) and vp = (ag, ... ,an,0,...). 
Proof: By Proposition 5, 0“), = mz implies x, = a; 'mz = o. If o is not 
a nonnegative integer, then p(c) = 0 and U(m) = 0, from (5) and (6), and 
the equation holds trivially. 

Now we let o be a nonnegative integer. Recall from Proposition 5 again 
that 


OF) = 9) — a,pxn, forr=1,...,k—1. 
Also, from (1), (2) and (4), we have 
g) = £, — aoXn = Ona (bn = 1)t — AoXn. 


So, if b, = y and By41 = (mo, ...,mz,0,...), it will be necessary and 
sufficient, from (5), that x, = o and 


Bn = (y,mo — (y — 1)* — aoo,mi — ayo, ... ,Me-1 — Ap-10,0, . . .). 


In more compact notation, for me N* we have B,,41 = m, b, = y iff Bn 
= T,(m — ovo), Xn = o. 

But x, is independent of b, and hence, from (4) and (5), of By. 
Therefore, 


Pr(Bn41 = m) = 2 Pr(By 41 =n m,b, =) 
72 


= Pr(x, =o) | Pr(B, = T,(m — ovo)). 
720 


The theorem follows by letting n — © and using (6). ,. 

The fundamental relation in Theorem 6 satisifed by U(m) was stated 
by Gopinath and Morrison,! in less compact notation. They also 
showed!? that, once the steady-state probabilities U(m) with mo = 0, 
corresponding to an empty buffer, were obtained, then the steady-state 
generating function for the buffer content could be calculated in terms 
of the generating functions for some marginal distributions. In this paper, 
we show how the quantities U(m) may be calculated for any value of mo, 
so that the steady-state probability that the buffer content is 1 may be 
calculated with the help of (7). In fact, we show how to iterate the fun- 
damental relation in Theorem 6 so as to obtain a finite system of linear 
equations for calculating U(m) for a fixed value of mo. This procedure 
was alluded to by Gopinath and Morrison! in the case mo = 0. 


ill, REDUCTION FORMULAS 


We first remark that, if m, = 0 then the summation in the funda- 
mental relation for U(m) in Theorem 6 includes the term corresponding 
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to y = mo + 1, so that this does not give a closed system of equations for 
U(m) for a given value of mo. To carry out the desired iteration of the 
fundamental relation, it is convenient to define some new quantities. 
Accordingly, we let 


Um) = Y U(T,(Gn)), (10) 
y20 
and, forr =1,...,k — 1, define 


U'+D(m) = UM (T,Gn)). (11) 
1 


y2 


Note that the summation starts at y = 0 in (10), but at y = Lin (11). In 
terms of the definition in (10), Theorem 6 may be restated as 


Theorem 6’: 
U(m) = p(c)UM(m — ovo), 


where o = a; ‘mp, p(o) = Pr(xpn = 0) and vo = (ag, ... ,ap,0,...). 
The Us are intimately related to the U’s, and analogous statements 
can be made about them. 


Theorem 7: U(m) # 0 implies, fork = 1 andr = 1,...k, that 
mé N*~’ and, fork = 2andr=1,...,k — 1, that a, divides mp_, and, 
forl=1,...,Rk—Yr, 

l l 


a Mpa S ap epg (mg TF yl) 
i= J=1 


Proof: Use induction on r. 

(r = 1) From (10), U“(m) # 0 implies that U(T,(m)) # 0 for 
some y = 0. By Theorem 8, T,(m)e N* and a, divides (T,(m))x, for 
some y = 0. Hence, from (9), me N*®~! and, for R= 2, a, divides mp-1. 
Using the inequalities in Theorem 3 on T',(m), we have 


E (Py (mnie SY adton-tejly 4 ~D. 
for/=1,....k. This translates into 
S Mp-i S > ao. ogy by); 
for/=1,...,k —1,and 


k k 

Sy ety = t= ya, of 47 = 1), 

l= jJ=1 

Since (T',(m))1 = mo — (y — 1)*, we must have y < mo + 1 in order for 
T,(m)e N*. It is necessary that the m,’s satisfy the above inequalities 
for the largest possible y, so we let y = mp + 1. Then 
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l l 
2 Meni = x ag aR—14+j(Mo + J), 
i= j= 
for! =1,...,k —1, and the other inequality is redundant, being trivial 
for k = 1, and implied for k = 2 by the inequality for! =k — 1. 
(r—>r+1) We consider r < k — 2, fork = 3. From (11), U’*+)(m) 
~ O implies that U(T,(m)) # 0 for some y = 1, so that T,(m)e N*-" 
and a, divides (T,(m)),—,-. Hence, from (9), me N*-"—! and a, divides 
Mr-r-1. Also, for some y = 1, 


l I 
au (T,(m))p-i-r41 S du ag Op (y +r +j — 1), 
i= j= 


for! =1,...,k —r. As before, y < mo +1, and for! =1,...k —r—lwe 
obtain the inequalities 


l I 
aX Mp-i-r S xX a9 QR-14j(Mo +r + j). 
i= J= 

As before, the inequality for 1 = k — r is redundant. 

It follows from the above that U“-)(m) + 0 implies that me N!, for 
k = 2. Hence, from (11), U‘)(m) # 0, for k = 2, implies that T,(m)e N! 
for some y = 1, so that, from (9), me N°. But we have already shown for 
k =1that U(m) # 0 implies that me N°. Hence, U“*)(m) # 0 implies 
that me N° fork = 1. x. 


Corollary 8: For fixed mo,k,r, there is only a finite number of m such 
that U“)(m) # 0. Moreover, each sum that defines each U“) is fi- 
nite. 
Proof: The first assertion is clear. For the second, we use (10) and (11) 
and the fact that (T,(m))1 = mo — (vy — 1).7 + 

Before we derive the relations for U“”)(m) corresponding to Theorem 


6’, we need some more definitions. For r = 0,...,k we define 
r 
Mr a L Qi, (12) 
i=0 


and, forr =0,...,k —1, © 
Vr a (pp, Ores es foe ,an,0, vs a) (13) 
We will make use of 


Proposition 9: T, has the properties: 
(t) Ty(m + m’) = T,(m) + R(im’). 
(it) For integers y = land y’ = 0, 


Pye mM) = Ts) Fy’, = y0ia 2): 
(iit) For integers y = land «= 0,andr=1,...,k — 2, 


Ty+ou,(m) — ov, = T(m — ov,+3). 
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Proof: (i) and (it) follow directly from (9). Also, using (iz), for integers 
y = land o = 0, we have 


Ty+5u,(m) rE T,(m) + (our, ~ our, 0, ae .) —~ Ov, 
= T.,(m) — o(0, us, + Or+1,A%r4+2,~..- ,ap,0, se .) 
= T.,(m) _ R(ov,+1) = T,(m = oVr41)- * 


For k = 2,r =0,...,k — 2, and o a nonnegative integer, we define 
S,(mj;o) a T oy,(m) — OVr. (14) 
Also, fork = 2,r=1,...,k ~lands =1,...,r, and aj,!mz-, a non- 


negative integer, we define V{S)(m): 


Vm) = 3 US)(S,_(T,,0...OT,,_,(an);a;,mp-;)), 


Yl>- ees Yr—-s21 


if s ~ r, where O denotes composition of the operators, and 


V(m) = U(S,_1(m;az!mp-—,)). (15) 
Lemma 10: V{?,Qm) = 21 VX(T,Qm)) fork 2 3,r=1,...,k —2and 
s=1,...,r,and a;! (T,(m)),-, a nonnegative integer. 


Proof: We will only consider the case r # s. The proof for r = s requires 
only a slight modification. From (15), 


~ VY(Ty(m)) 


y21 
=> by US)(S,-1(T,, O...0T,,_, OT,(m);0)), 
y21 y1,..-,Yr-s21 


where o = aj '(T,(m)),—,. However, from (9), (Ty(m))z—- = Mp—r—1 for 
k —r = 2. Therefore, o = aj '(T,(m)),—, = a, '(m),—-,~1, and if we let 
Y = Yr+1~-s, then the above expression is equal to V‘),(m). ., 
Theorem 11: Letk 2 2. Forr =1,...,k —1, we have the following for- 
mulas: 


Um) = p(o) | uo Ss vem) | 
s=l 


where o = aj,'(m),-- and o ¥ 0. If « = 0, then 
U“)(m) = p(0)[U"t Pam) + VY Gn)]. 


Proof: We use induction on r. 
(r = 1) Note here that the two cases coincide. From (10) and 
Theorem 6’, 


U®(m) = + U(T,(m)) = XY p(o)U(T,(m) — ovo), 
7y20 720 


where o = a;'(T,(m)),x. But (T,(m));, = mp1 for k = 2; therefore, o 
= aj,'mMp_, and so a is independent of y, and 
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U(m) = p(o) © U(T, Gm) — ov). 
720 


If o is not a nonnegative integer, then p(c) = 0 and U‘)(m) = 0, and the 
required result holds trivially. If o is a nonnegative integer, then we let 
g = Y — ouo and obtain 
UM (m) = p(o) a UD (To4649(m) — ovo). 
q2=—-on0 
The zeroth term of Ty+¢u9(m) — ovo is g, so, by Theorem 7, any terms 
where g < 0 vanish. Hence, 


UM (m) = p(c) | UP ono'm) — ovo) + UOT 5 4549(m) — om) | 


q21 
= plo) | Ue Solna) + 5 UOT, (m - on))| 
gqz21 


= p(o)[V\m) + U®)(m — 0). 


The last two steps follow from (11), (14), and (15), and Proposition 9, 
and the fact that o = a; 'mp,—1, to use the definition of V$". 

(r>r+1) We consider r < k — 2, fork = 3, and first assume that 
o = ay Mp-p—1 ¥ 0. But (T,(m))x-- = Mp-r—1, for r < k — 2. Hence, 
o= aj (T(m)) p+ ~ 0, and we may use our inductive hypothesis on 
U)(T.,(m)). From (11), since o is independent of y, we obtain 


U't(m) = p(s) © | oeencr am — avy) + > vin(rytan) |. 
y21 s=1 


If co ¥ Ois not a positive integer, then p(c) = 0 and U"'+!)(m) = 0, and 
the required result holds trivially. 
If o is positive, then, using Lemma 10, we have 


U’+)(m) = p(o) | > Ulrt)(T,(m) — ov,) + a Vien) |. 
y21 s=l1 
Also, if we let g = y — cu,, then 
y UCtD(T,(m) — ov) = XY UCT ot ou, (m) — ov). 
y21 q21—opr 


But o and yu, are positive integers, so ou, = 1. Hence, by asimilar argu- 
ment to the case r = 1, 


U’+)(T,(m) i ov,) = es UO To 4 e4,(m) — ovr) 


y21 q20 


= U"t)(T,,,,(m) — ov,) +  UTtHY(T (am — ov,41)) 


q21 
= Vian) + UCH2%) (mM — ov,+41), 


where we have used (11), (14), and (15), and Proposition 9. Conse- 
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quently, 
r+1 
U’t+)(m) = p(c) | Um 094) 5 ee veestam) |, 
s=1 


where o = aj, !Mp—;—1. 

Finally, we consider the case o = 0, so that (T,(m)),-, = Mp—p-1 = 
0. Then, using (11) and the inductive hypothesis on U“)(T,(m)), we 
have 


Ult+)(m) = p(0) © [Ult(T,(m)) + VPC Gm))] 
y21 


= p(0)(U"t?)(m) + VER (m)], 


from Lemma 10. .. 

Having derived the reduction formulas of Theorem 11, we now com- 
ment on the quantities V)(m) defined in (15), under the assumption 
that a; ‘mp—, is a nonnegative integer. It may be verified, from the def- 
initions in (9), (13), and (14), that, for Rk = 2 and r = 1,..., 
k — 1,S,—1(m;a;!m,_-)e N*-" implies that me N*-". Also, for k = 3, 
r=2,...,k —1,s = 1,...,r — 1, and positive integers y1,...,Yr-—s; 
S,-(T,,0... OT,,_.(m); a;!mp-,)e N*-* implies that me N*’-’. 
It follows, from Theorem 7, that, fork = 2,r=1,...,k - lands = 
1,...,r, V(n) # 0 implies that me N’~". Also, for r = 0,...,k — 1, 
we note that (m — ov,)e N*-"-1, where o = a; 'mz_, is a nonnegative 
integer, implies that me N’~". 


IV. THE STEADY-STATE PROBABILITIES 
We define the sets 


Q; = {U(m)|0 < mo <i, ma proper state}, (16) 


where the proper states satisy the criteria of Theorem 3. The sets Q; are 
finite, for fixed k, by Corollary 4. We will first show how to calculate the 
elements of Qo. Then, as shown by Gopinath and Morrison,!? the 
steady-state generating function for the buffer content can be calculated 
in terms of the generating functions for some marginal distributions. The 
marginals are finitely solvable, in the sense that a finite number of 
components of the marginal distributions can be solved for, from a finite 
number of linear equations. However, we will give an alternate method 
for calculating the steady-state probability that the buffer content is 1, 
which also involves a finite number of linear equations. In fact, by in- 


duction on 1, we show how to calculate the elements of 0;,1 = 1,2,..., 
and hence «1,... ,x;, from (7). 
We begin by defining the sets 
A; = {U(m)|0 < mp Si,1 Sr Sk, ma proper state}, (17) 


where the proper states satisfy the criteria of Theorem 7. The sets A; are 
finite, for fixed k, by Corollary 8. We also define the sets Aj, which are 
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obtained from A; by deleting the single element U‘*)(i,0,0, .. .), that 
is 


AG = Ap (O00) 3)}. (18) 


We will first show how to determine the elements of Aj, and thence the 
elements of Qo. We will make use of 


Lemma 12: Fork = 2,r=1,...,k —lands=1,...r, the quantities 
V‘) (m) are linear combinations of the elements of Aj. 

Proof: From (9), (18), and (14), it follows that (S,;_,;(mj;o))9 = 0. 
The result is then a consequence of the definitions in (15), (17), and 
(18)i.3% 

If k=1, then Ao contains the single element U“)(0,0,...), since 
me N&-" for a proper state, and hence the set Aj is empty. If k = 2, then 
Ag contains at least one element, namely, U‘*-)(0,0,...). (If k = 2, this 
might be the only element.) We now apply the reduction formula of 
Theorem 11 to each element of Aj ~ U'*-)(0,0, . . .). But for mp = 0 and 
o a positive integer, (m — ov,)o < 0, and hence U’+)(m — op,) = 0. 
Hence, from Lemma 12, we obtain a system of homogeneous linear 
equations which contain as unknowns only the elements of Aj. Note that 
we have omitted the reduction formula for U‘*-))(0,0, . . .). Since there 
is one more unknown than the number of equations, we can solve for the 
elements of Ag to within a multiplicative constant. 

We are now in a position to determine the elements of Qo. If k = 1, 
then, from the inequality in Theorem 3, 2 contains just the single ele- 
ment U(0,0,.. .) = xo, from (7), and xo is given by the formula in Prop- 
osition 1. If k = 2, then the elements of 0p are given by Theorem 6’ in 
terms of elements of Aj, since (m — ovo)o < 0 if mp = 0 and a is a non- 
negative integer, and U“)(m — ovo) = O if (m — ovo)o < 0. Hence, the el- 
ements of {9 are determined to within a multiplicative constant, which 
is determined by (7), in terms of xo. The elements of Aj are now also 
completely determined. 

We next turn our attention to the calculation of the elements of A} and 
Q;, fori = 1,2,.... First, however, we need 


Lemma 13: The assumption ppE (x) < limplies that p(0) > 0. 
Proof: E(x) = >o7-1 p(t) = 7-1 p(t) = 1 — p(0). But, from (12), since 
ag ~ 0 ¥ ax, it follows that u, = 2, and hence E(x) < %. ., 

We have shown how to determine the elements of Aj and Qo. We will 
show how to determine the elements of Aj and Q;, fori = 1,2,....We 
first consider the special case k=1, and use induction on 1. 


Theorem 14: For k = 1, if the elements of Aj and Q; are known, then the 
elements of Aji, ~ Aj and Q;41 ~ Q; may be determined. 

Proof: From (17) and (18), since k = 1 and me N#-" for a proper 
state, 


i+ _ Aj = {UY,0, o- .)}. 
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But, from Theorem 6’, 
UG,0,...) = p(O)UM(,0, ...). 


Since p(0) > 0, by Lemma 13, and U(i,0,. . .)e Q;, this equation deter- 
mines U((7,0,...). Also, from Theorem 6’, U(i + 1,m1,0, . . .) is deter- 
mined for m,; ~ 0, if there are any such elements in 2;41, since o > O and 
so (m — ovo)o) < 1 + 1. The remaining element of 0;4; ~ Q; is 
U(i + 1,0,...), since me N! for a proper state. But, from (9) and (10), 


O20. JS 0OGN eG GS) ois) 
y=0 


so that the remaining element is determined. ,.. 
We now consider the general case, and establish 
Theorem 15: Fork = 2, if the elements of Aj are known, then the ele- 
ments of Aji; ~ Aj may be determined. 
Proof: From Theorem 11, 


U®-DG,0,...) = p(0)[UG,0, ...) + V4G0,.. DI, 


which equation was omitted for 1 = 0. This equation determines 
U‘®)(7,0,...), by Lemmas 12 and 13, since U‘*-)(7,0, . . .)e At. Also, if 
mj =itlil<rs<k-lando=a;'mp-_, > 0, then U“(m) is deter- 
mined by Theorem 11, since (m — ov,)y <i + 1 for ¢ > 0. The remaining 
elements of At,,; ~ Aj are U'”)(m) with mp =i t+1,1<r<k—land m,_, 
= 0. 
But from (11), 
i 
Uk-DG +1,0,...) = U'®G0,...) -— SS UP-VDiyt — y +:1,0,...), 
yvy=1 

where the summation is absent if 1 = 0. This determines 
Utk-D(i_ + 1,0,...), and if k = 2 this is the only remaining element in Aj,, 
~ At. Ifk = 3, there still remain U“)(m) with mp =i +1,1sr<sk-2 
and m;_, = 0, and from Theorem 11, 


U(m) = p(0)[U°+Y(m) + VP (m)]. 


But we have just determined U“~)(i + 1,0,...), and so we know 
U®-DG + 1,m,,...) for m; 2 0. Hence, from the above equation, by 
Lemma 12, we may determine U‘*-2)(m) with mp =i + 1, and mz = 0. 
We then know U‘*-2)(m) with mp = 1 + 1 and mz = 0. By iteration of the 
above equation, we may determine any remaining elements of Aj.) ~ 
Aj. ss 
Lemma 16: Fork = 2, the elements of Q; are determined by elements 
OF AG fOr = 1,258 
Proof: The result follows from Theorem 6’. ,, 

We have shown that the elements of Aj and Q;, for: = 1,2,..., may 
be determined explicitly, once the elements of Aj, and Qo, are known. 
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The determination of the elements of Aj, however, involves the solution 
of a homogeneous system of linear equations. 


V. A PARTICULAR CASE 


We now confine our attention to the particular case &,, = x, + Xn—r, 
so that; from (2), 


ag =1l=apr, aj = 0 otherwise. (19) 


Weare interested in determining the number of proper states m of U(m), 
and also of U‘")(m), as defined by the criteria of Theorems 3 and 7. We 
show in the appendix that these criteria lead to a precise count of the 
number of nonzero U’s and U's when (19) holds, if p(i) > 9, 
i=0,1,2,.... 

We will make use of 


Lemma 17: For r = —1,0,1,...,ands = 1,2,..., the number of elements 
of ne N*-! which satisfy the conditions /=4n;j <r+lforl=1,...,s 
1s 
r+2s r+ 2s (r+ 2)(r + 2s + 1)! 
ee a ee 
s—-2 sl(r+s+ 2)! 


Proof: We use induction on s. 

(s = 1) The number of np with O < no Sr +1isclearly r+ 2 = 
F(r,1). 

(s >s +1) Now \/idn; <r+/forl =1,...,s + 1 implies that 
no <rt+iand ¥/jn; <r+l+1-—noforl =1,...,s. Hence, 


=Fi(r,s). (20) 


r+1 r+1 r+1 
Pirst+1)= Y P(r+1—ngs) = © Pls) = YL Fs), 
i=0 i=0 


no=0 


from the inductive hypothesis. But, as may be verified, 


F(i,s) = F@ — 1,s+ 1) — Fi — 2,5 + 1). (21) 
Hence, 
r+1 
> Fii,s) = F(r,s + 1), (22) 
i=0 


since F(—2,s + 1) = 0. .. 


Corollary 18: For fixed mo and k, the number of proper states m of 
U(m) is F(mg — 1,k), and the number of proper states m of U)(m) is 
F(mp +r—1,k —1), forr=1,...,R. 
Proof: The results follow from (19), Theorems 3 and 7, and Lemma 17. 
Note that, for r = k, the only proper state of U‘*)(m) is (mo,0, . . .), since 
me N°, and we have F(mo + k — 1,0) = 1. 4 

From (16) and Corollary 18, it follows that the number of elements 
of Qo is 
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(2k)! 


Qo| = F(-1,k) = ———_—_.. 23 
1g) PE) Rk + 1)! ee 
Also, the number of elements of 2; ~ Mp is 
|;| — |@] = © Fm —1,k) = FG — 2k + 1) 
mo=l 
(i +2 1)! 
i(i + 2k +1) (24) 





~ EPINGER EAU 
from (20) and (22). From (17) and Corollary 18, the number of elements 
of A; is 


[A;| = > hier: (25) 


mo=0 r=1 
But 


(Pa 2) Ie yh 
(r+s + 2)!s! 


Therefore, from (21) and (26), we have 
F(mop +r — 1k -—r) = —F(—(mo + r+ 3),mo + k + 1) 





F(-(r+ 4)r+s+2)= —Fi(r,s). (26) 


= —|[F(-—(mo Prt 4),mo + k+ 2) — F(—(mo ee Ae 5),mo +k+ 2)]. 


Hence, if we sum and use (26), we obtain 


M= 


F(mp +r — 1,k —1r) 


1 
= —[F(—(my + 5),mp + k + 2) — F(—(myp +k + 5),mp + k + 2)] 
= F(mo + 1,k — 1) — F(mp +k + 1,-1) = F(mp + 1,k-1). (27) 
From (21) and (25), it follows that 
|A;| = F(@,k) — F(-1,R). (28) 
Note, from (23), (24), and (28), that 
\Q;) + |A;| = FG -2,.k +1) + FUR) = FG - 1,k + 1), 


from (21). 

Of particular interest, for k = 2, is the number of equations required 
to determine the elements of Aj to within a multiplicative constant, 
namely | A$] —1 = | Ao|—2, from (18). But, from (21) and (28), 

3(2k)! 
(k-—1)'"(k +2)! 


r 


[Ao] = F(0,k) — F(-1L,k) = F(Lk - 1) = 


STEADY-STATE PROBABILITIES OF BUFFER 3111 


The first few values of | Ao| — 2 and |ol, as given by (23), are 


k 2 3 4 5 6 
| Ao] — 2 1 7 26 88 295 
| Qo| 2 5 14 42 132 
We also have the asymptotic result 
Aol — 2 
Gea AOE 
ko |Qo| 


Vi. AN EXPLICIT EXAMPLE 


We here consider the example corresponding to k = 4 in (19), so that 
En = Xn + xXn-4. We will explicitly determine the elements of 0p for this 
example. As discussed in Section IV, the reduction formula of Theorem 
11 is applied to each element of Aj ~ U‘)(0,0, .. .). Then the elements 
of Qo are determined with the help of Theorem 6’ and the normalization 
condition (7) with i = 0. 

From (17), (18), and Theorem 7, the elements of Aj, with an obvious 
change of notation, are 


Up momy M3 S1,m2+ m3 S 2,m, + m2+ mz S38, (29) 
US ims M2 S2,m,+ me S 3, (30) 

and 
US, mS 3, (31) 


where m, m2 and m3 are nonnegative integers. From (16) and Theorem 
3, the elements of Qo are 


Oomimsms0s M3 < 1,mo + m3 < 2,M1 + mo + M3 < 3. (32) 


But from Theorem 6’, again with an obvious change of notation, 


Uomymom30 = POU Winans . (33) 
From (7), the normalization condition is 
ko = >» Um nsie0 = Po eo) eer (34) 


where the summations are over the range of subscripts satisfying the 
inequalities in (29) and (32). 

We now apply the reduction formula of Theorem 11 to each element 
of Aj ~ US}, and note, from (12) and (19), that 


wy = 1, r = 0,1,2,3. (35) 
From (9), (13), and (14), with m = (mo,m1,m2,m3,0, .. .), we have 


mM — M3r1 = (Mo — M3,M1,Mzo,0, .. .), (36) 
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and 
So(m;m3) = (0,mo — (mg — 1)*,m4,Mo,0,...). 
Hence, from (15) and (37), 
| VP(m) = Uso (mg-bymima 
It follows from Theorem 11 that 


UO imso = PUP ms of UP Primads 
and 


UG asi = PO nae ’ 


: 2 ne 
since U@) ns ms = 0. 
Similarly, with m = (mo,m1,mo,0,.. .), 


Mm — Moro = (mo = mo,m,0, a <)5 
Si(m;m2) = (0,mo (mg = 1)*,m4,0, os Je 


and 


So(Ty,(m);m2) = (O,¥1 — (m2 — 1)*,mo — (41 — 1)t,m4,0,.. 


Hence, from (15), 
VS(m) = Co naeil tins 
and 
VP(m) = LUN} s-(m2-1)+,mo-1- D+ mt 
yizl 
It follows from Theorem 11 that 
USi.0 = PoUsh, + Ulm)» 
and, for my ¥ 0, 
Udine = Pmo (U0) mg~iys int + UD -Gno-1)+ 0,1) 
Hence, 
Usinut = PU Sm + Ubioms) 
and 
Udiny2 = P2U Som: 
Next, with m = (mo,m 1,0, .. .), 
Mm — Myp3 = (Mop — M),0,...), 


So(m;m) = (0,mo 7 (my a 1)*,0, oe aS 


S1(7T,,(m);m1) = (0,71 ~ (my — 1)+,mo ~~ (y1 > 1)*,0, o yy 


and 


a 
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(37) 


(38) 


(39) 


(40) 


(41) 


(42) 


(43) 


(44) 


(45) 


(46) 


(47) 


(48) 


(49) 


(50) 


(51) 
(52) 
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So(Ty,(Ty.(m));m1) 
=Ojc— ip) 7 tS De ye 0 es) 
Hence, from (15), | 
V{)(m) a LS), eat ) 


V2(m) = os UP —(mi—-1)+,mo-(y1-1)+ » 
yi21 


and 
1 = 1 
VS (m) = ye OS) pits Getter seiis : 
YLy221 


It follows from Theorem 11 that, for m; ¥ 0, 


U$>, a Pm, (UP mn — USl-om-+0 


1 
a » Ut -tm-vet-en-v+0) ‘ 
yizl1 


(53) 


(54) 


(55) 


(56) 


(57) 


We now write out in full the nontrivial equations corresponding to (39), 
(40), (46), (48), (49), and (57), omitting terms which are identically zero. 


From (39) we have 
Udsoo = Po(Usoo + Ubooo) Ussio = Po(US + Usion), 
Usooo = PoU soe, Ubsibo = Po(US + Ubtio) 
Usvio = Po(USA + Ui), — Ubtso = poh, 
Ussho = Po(Uss + Ustro), Usto = PoU bs, 
Ussbo = = poU bio, 
and from (40) we have 
Usto1 = PiU boo; Usti = PiU soon, Usior = PiU soho, 
Us = piU iu Ushi = PiU sho. 
Next, from (46) we have 
Uso = po(US) + Ubivo), Usth = po(US? + Ustor) 
Ussh = PoU§?, Usso = PoUS?, 
from (48) we have 
Uso = PiU + Ubioo), 
Ut = pi(US, + Usion), US, = PiU bob, 
and from (49) we have 


Ug = po hhro: US) = po2U soo. 
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(58) 


(59) 


(60) 


(61) 


(62) 


Finally, from (57) we have 
US$? = pr(USy + Ubih + USio + Ubsoo), 
US$} = pol US + Ustio + Ubroo), Us} = p3U hobo. (63) 
We may eliminate the 13 nonzero quantities U. Pes and U$}, from 
(58) to (63), and solve for the 14 nonzero quantities U))), moms to within 
a multiplicative constant. It is found that 
Udo = a0, Ustor = P10, Usb az pido, 
Us = = pido, Usbdo= PoP 240, Ubsbo = = Pop3do, © 


Ubiro = Udsio = Ubto: = PoP 1P240; (64) 
and 
Uspio = PiAao, Usto: = piAao, 
Ustio = PIL + popi)Aao, — Ubtho = Popa(1 + pop1)Aao, 
Ustoo = Pill + palpi + pop2)(1 + pop1)] Ado, (65) 
where 
= {1 — popill + po*(p1? + Pop2)(1 + pop) it. (66) 


The constant ap is determined by the normalization condition (34). 
These results are consistent with those derived by a different 
method.? 


APPENDIX 


We show here that in the particular case corresponding to (19), the 
criteria of Theorems 3 and 7 lead to a precise count of the number of 
nonzero U’s and Us if p(i) > 0, i= 0,1,2,.... We first prove 


Theorem 19: If (19) holds, p(i)>0, i=0,1,2,..., me N*®and 


M-~ 


Mp—-i41 SMa tl—1, [=1,...,R, (67) 


i=] 


then U(m) = 0. 
Proof: From (4), (5), and (19), it follows that 


B, = (On: Xn—bs tee nH 1,9; o «)e (68) 


It was shown? that the irreducible Markov chain, with state space con- 
sisting of those states which communicate with (0,0, . . .), is positive re- 
current. Moreover, it was also shown that, in the present notation, the 
state (19,0, ...) communicates with the state (0,0,...), where ig is a 
positive integer. Hence, with probability 1, the state (i9,0,.. .) occurs 


STEADY-STATE PROBABILITIES OF BUFFER 3115 


infinitely often. We now assume that B,~, is given by 


k 
bn-k = Mo tk —- 2 Mp-i+1, Xn—2k+r—1 = 0, dak ee 
t= 
(69) 
Also, with positive probability, since p(t) > 0, 1 =0,1,2,..., 


Xn—k+r—-1 = My, r=1,...,kR. (70) 


We will show that (67), (69), and (70) imply that b, = mo, and hence, 
from (6), that U(m) + 0. 
We first show, by induction, that 
k-r 
Dn-ktr = Mo tk—-r—- DY me-i4i 21, r=0,...,R—1. (71) 
i=l 
This is true for r = 0, from (67) and (69). 
(r >r+t+1) We consider r = 0,...,k — 2, fork = 2. Since £, = 
Xn + Xn—p, it follows from (1), (69), and (70) that 


bn—-k+r+1 es (bn—k+r = Wa + M41 


= bn-ktr — 1+ mrt 


k—-r-1 
=motk—(rt1)— YX mein 2], (72) 
i=1 
from (67). This completes the inductive proof of (71). Finally, with the 
help of (71), we obtain 


bn a (bn—-1 +3 1)* + Mp = Dnat Bere ex Mk = Mo. «x 


We now prove 
Theorem 20: Suppose that (19) holds and p(t) >0, 1=0,1,2,....Also 
suppose, fork =>1landr=1,...,k,that me N*-"and, fork = 2and 
r=1,...,k —1, that 


! 
> Mp-j-r+41 {Mo trt+1l1—1, [=1,...,R—r. (73) 
i=l 
Then U“)’(m) + 0. 
Proof: Use induction on r. We note, from (6), (10), and (11), that U‘)(m) 
20;. P= Tyceesk. 
(r=1) Let 


th = Tmo+1(m) = (mo ae 1,0,m,, an 5 (74) 


from (9). We will show that U(m) + 0, which implies that U“)(m) = 0, 
from (10). If k = 1, then mh = (mo + 1,0,0,...), hence rh € N! and 
0=m, < Mo = mo +1. It follows from Theorem 19 that U(mh) + 0. Ifk 
> 2, then mm € N* sincem e€ N*-}, and, from (73), 
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l 
> Mp-it41 S Mo +1 —1, b= do. 4k — 1, 
i=1 


k k~1 
> Mre-i41 = DS Mp-i41 S Mo +k -—2SmMotk-1. 
i=1 i=l 
It follows from Theorem 19 that U(mh) + 0. 
(r-1—r) We consider r = 2,...,k — 1, fork = 3. Then, from 
(74),m e€ N*-' implies that m € N*-'+!, Also, from (73), 


l 
> Mp-i-rgo = Mo trtl— 2, l=1,...,k—f7, 
i=] 


k-rt+1 kor 

Pp Mp—j—r+2 = 2X Mrpi-r42. {Mg tk -2SmMot+k—-1. 

i= i= 
It follows from the inductive hypothesis that U’-(h) ¥ 0. Hence, from 
(11), U™(m) + 0. 

(k) m €N®, for k = 2, implies that th Ee N! and 0 = m, S 

motk—-1=motk. Hence, U“’-(th) ¥ 0 and, from (11), U“)(m) + 
Ona 
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(Manuscript received March 28, 1978) 


Technological advances are continually increasing the economic 
viability of efficient codecs in telephone networks. A subjective eval- 
uatton is described here of the 1255 Pulse Code Modulation (PCM) al- 
gorithm and three more efficient techniques, Nearly Instantaneous 
Companding PCM (NIC PCM), Cummiskey-Jayant-Flanagan Adaptive 
Differential PCM (ADPCM), and Subscriber Loop Carrier Adaptive 
Delta Modulation (SLC* ADM). These codecs are compared under the 
conditions of: (i) single encodings as a function of line bit rate, input 
level, received volume, and error rate, (ii) tandem encodings with in- 
termediate baseband conversion, and (iit) local, exchange, and toll 
network reference connections where mixed tandem encodings might 
be found along with typical analog impairments such as loss and ran- 
dom noise. The simulation of the codec algorithms on a minicomputer 
facility enabled the production of subjective test tapes containing 
speech processed under these conditions. These tapes were then eval- 
uated in listening-type subjective tests. It is shown that (i) NIC PCM, 
ADPCM, and SLC ADM have approximately a 12- to 16-kb/s advantage 
over 4255 PCM for equivalent subjective ratings, (i1) NIC PCM, ADPCM, 
and SLC ADM perform comparably over the range of conditions tested; 
and (iit) 64-kb/s 4255 PCM can be deployed in a multiple encoding 
environment with very few restrictions, whereas the use of lower bit rate 
NIC PCM, ADPCM, and SLC ADM codecs would necessitate more strin- 
gent application rules to avoid excessive degradation in tandem en- 
coding situations. 


* Trademark of Western Electric. 
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Il. INTRODUCTION 


In recent years, a great amount of interest has been expressed in the 
literature on the subject of efficient encoding of voiceband signals. Much 
of this interest stems from the economics of bandwidth reduction that 
are possible with efficient codecs. Waveform codecs such as differential 
PCM (DPCM), adaptive differential PCM (ADPCM), delta modulation 
(DM), and adaptive delta modulation (ADM) are considered for general 
use in telephone networks because, among other reasons, they often are 
a reasonable compromise between the bandwidth required of the 
transmission channel and the terminal complexity at the ends of the 
channel. 

Aside from the economics, there is the issue of degradations introduced 
by a codec in speech and voiceband signals. In fact, a signal may undergo 
a number of encodings by codecs of different types as it progresses 
through the network. An example of this is the hypothetical network of 
Fig. 1. Here, a mixture of analog and digital switching and transmission 
facilities are represented. 

The toll portion of the network incorporates analog toll switches such 
as the No. 4 crossbar and the digital No. 4 ESS switch. The D channel 
banks and VIF terminals are shown in order to point out where analog- 





CENTRAL A/DESD/DR ee eae ae i ae 
OFFICE 


ANALOG FORMAT 

= = DIGITAL FORMAT 

D = D CHANNEL BANK 

V = VOICEBAND INTERFACE FRAME (VIF) 


Fig. 1—Possible codec deployment in DDD network. 
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to-digital conversions take place. The coding law currently used in Bell 
System D channel banks is 64-kb/s 1255 PCM.! 

The exchange network environment favors the deployment of efficient 
codecs because of the investment in the exchange and loop plants. 
Subscriber carrier and remote switching systems may be installed on 
loops and interfaced directly to a digital central office. Other carrier 
systems may be implemented by a digital-to-digital (D/D) conversion 
from one code format to another more efficient format as shown in the 
exchange trunk example of Fig. 1. 

The point is that a telephone call can be routed over a variety of con- 
nections in this network and might be subjected to 64-kb/s 4255 PCM 
encodings in the toll plant in tandem with other codecs found in the local 
environment. The goal of the studies reported in this paper is to gain 
insight into the subjective effects of a multiple encoding environment 
on speech. Another important component of this goal is the impact of 
this environment on voiceband data. However, the resources and time 
consumed by the speech studies did not allow simultaneous work on 
voiceband data in the context of this study. 

Prior to the evaluation of the overall subjective effect of a network - 
environment on speech, the contributions of the various components 
of the network must be well understood. The analog components can 
be represented by transmission impairments such as loss and additive 
random noise.? The performance of a digital component of a connection 
is a function of the quantizing noise characteristics of the digital en- 
coding, the line bit rate, the speech input level to the codec, and trans- 
mission impairments such as bit errors, slips, and misframes. The initial 
step of the study concentrated on the evaluation of some of these im- 
pairments for single encodings by selected codecs. Next, the codecs were 
tandemed with themselves under controlled conditions in order to es- 
tablish the multiple encoding behavior without the complications of more 
than one coding law. Finally, selected codecs were incorporated into a 
set of reference network connections. These reference connections are 
representative of Bell System local and toll connections with charac- 
teristics (loss, noise, talker volume) that are derived from survey 
data. 

The purpose of this paper is to report on recently completed subjective 
tests of digital codecs. Detailed results of the tests are presented and 
preliminary analyses are discussed. However, another objective of these 
tests is to provide a sufficiently large data base to enable the development 
of analytic models of subjective behavior. These models would be used 
to predict the performance resulting from the introduction of digital 
codecs in an evolving telephone network. 
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ll. CODEC ALGORITHMS 


Four basic codec algorithms were evaluated: (1) «255 PCM,?4 
(it) Nearly Instantaneous Companding (NIC) PCM,? (iii) Cummiskey- 
Jayant-Flanagan ADPCM,® and (iv) Subscriber Loop Carrier (SLC*) 
ADM.’ These four were chosen for one or more of the following reasons: 
interest in the algorithm, availability of a well-defined solftware algo- 
rithm in the time frame of these tests, total number of test conditions 
to be generated, and the existence of a hardware implementation of the 
codec. These codecs are samples of four different classes of waveform 
codecs, but each chosen codec is quite specific and the reader is cautioned 
against generalizing the results for a particular codec to an entire class 
of codecs. Note that NIC PCM, ADPCM, and SLC ADM employ adaptive 
quantization (or companding) but none of the codecs incorporates 
adaptive prediction. 

A brief description of some of the characteristics of each algorithm 
is given here which, in conjunction with the references, should aid the 
reader in understanding the similarities and differences among the al- 
gorithms when the results are discussed. 


2.1 4255 PCM 


Three versions of the 4255 PCM algorithm were included in the tests: 
the continuous law compandor® with a mid-tread bias and the 15- 
segment version‘ of the 4255 compandor with both a mid-tread and a 
mid-riser bias. However, most of the attention is focused on the 15- 
segment mid-tread algorithm, since this algorithm at 64 kb/s is used in 
Bell System D channel banks.? Since the idle channel noise of a mid- 
tread algorithm implemented on a computer is essentially nonexistent, 
16 dBrnC0 of random noise is introduced at the outputs of both mid- 
tread algorithms. This level of noise is intended to represent that which 
could be achieved in a hardware implementation. The overload point 
in all three cases is set at the peak amplitude of an inband sine wave with 
an rms power of +3 dBm0. 


2.2 NIC 


This algorithm is a block encoding scheme® which compresses L -bit, 
15-segment 4255 PCM (L 2 4) to L — 2 bits. In this application, there are 
eight samples to a block. The largest segment number in the block is 
found and transmitted to the far-end decoder. The eight L-bit 4255 
samples are then digitally reencoded into L — 2 bit uniform samples with 
an overload point at the top of the largest segment in the block. This 
yields a bit rate of [8 X (L — 2) + 3] kb/s for an 8-kHz sampling rate, 
where the 3-kb/s component represents the transmission of the maxi- 


* Trademark of Western Electric. 
t Except for older D1 channel banks, where 56-kb/s 1100 PCM is used. 
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mum segment number in the block every millisecond. The NIC algorithm 
is biased mid-tread for L = 7 bits and mid-riser for L <6 bits. As in the 
PCM algorithms described above, 16 dBrnC0 of noise is added to the 
decoder output during mid-tread operation, and the overload point is 
set at +3 dBm0. 


2.3 ADPCM 


The Cummiskey-Jayant-Flanagan ADPCM algorithm® was chosen 
using a first-order predictor in the feedback loop with a time constant 
of 0.43 ms. For line bit rates greater than or equal to 32 kb/s (L 2 4 
bits/sample at 8-kHz sampling), the minimum quantizer step size is 
equal to twice the interval on the first chord of the 8-bit 4255 PCM 
quantizer. For L < 4, the minimum step-size is increased by the factor 
(5 — L) over the step-size for L = 4. For all values of L, the ratio of the 
maximum step-size to the minimum step-size is 128, identical to the 8-bit 
4255 PCM algorithm. No amplitude overload point was set for this al- 
gorithm, since it is a differential codec, but for L = 4 the algorithm is 
driven into slope overload with a 425-Hz sine wave at +6 dBm0. A 
modification of this algorithm was also investigated to determine the 
subjective effect of line bit errors. This modification was the introduction 
of step-size leak into the step-size adaptation logic. Normally, 


A; = a; A;-1, 


where A; is the quantizer step-size at a particular sample interval i and 
a; is the corresponding adaptation coefficient. An error in the trans- 
mission of the bit stream to the decoder will cause an error in the decoder 
step-size adaptation. The effect of bit errors can be minimized some- 
what by the addition of step-size leak, so that 


A; = a;(Aj-1)y, 


where y is less than but nearly equal to unity. In our application, y is 
chosen to be 31/32. This limits the effect of an error at the decoder so that 
the encoder and decoder step-sizes eventually track each other. However, 
the adaptation process remains affected by the. step-size leak in the ab- 
sence of errors, and discernible distortion may be introduced into 
the speech. 


2.4 SLC ADM 


The ADM algorithm chosen was the SLC-40 algorithm.’ It is currently 
deployed at a sampling rate of 37.7 kHz in a 40-channel loop carrier 
system. Of the three non-PCM coding schemes described in this section, 
it is the only one in commercial use in the Bell System, with approxi- 
mately 1800 systems installed in the field to date. This algorithm uses 
an adaptive step-size where the step-size is altered whenever four suc- 
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cessive like sign bits are transmitted on the line. The predictor in the 
feedback loop has a main time constant of 0.7 ms. The minimum step- 
size is set to achieve an idle channel noise of 15.5 dBrnC0 at 37.7 kb/s. 
This codec is driven into slope overload with a +6 dBm0 input sine wave 
at a frequency of 800 Hz. 


lll. SIMULATION SYSTEM 


3.1 Overview of system 


A minicomputer facility which was developed at Bell Laboratories in 
Holmdel, N.J., is briefly described in this section. A detailed description 
of its capabilities and operation can be found in Ref. 8. 

This system has four important characteristics: (1) the system con- 
figuration is independent of any particular codec algorithm; (ii) the 
system is conducive to experimentation and development of a desired 
codec algorithm; (iii) the system is capable of simulating a network 
connection containing mixed tandem encodings of several codecs; and 
(tv) the system is capable of automatic production of audio tapes suitable 
for subjective evaluation. All the codec test conditions discussed in this 
paper were simulated on this system. 

Two PDP*-11 minicomputers are interfaced with uniform 15-bit A/D 
and D/A converters. The system is capable of real-time operation at 
sampling rates up to 72 kHz. A particular codec algorithm can be im- 
plemented on this system as long as the sampling rate of the codec is less 
than 72 kHz; the codec distortion dominates the 15-bit A/D and D/A 
converter distortion; and differences between hardware and software 
versions of the codec are recognized and accounted for. 


3.2 Hardware configuration 


Referring to Fig. 2, P>P 11/40 and 11/20 minicomputers are used, each 
having 28K words (16 bits/word) of memory. Each minicomputer is in- 
terfaced to a dedicated Tustin series 1500 A/D and D/A converter sys- 
tem. The A/D and D/A converter systems were measured and adjusted 
to ensure that they were capable of at least theoretical 14-bit accuracy 
so that the waveform codecs of Section II could be simulated. 

A 1.25-million-word disk system was installed on each machine with 
a nine-track tape drive interfaced on the 11/40 for mass storage. The tape 
format is standard, so that large amounts of data can be transferred to 
other more powerful computer facilities for processing. An Ampex AG- 
440G analog tape unit is also controlled by the 11/40 so that high-quality 
audio tapes can be automatically prepared for subjective testing. 

In addition to the dedicated equipment on each processor, there is a 
UNIBUS* window between the two machines to allow interprocessor 


* Registered trademark of Digital Equipment Corporation. 
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Fig. 2—Codec simulation facility. 
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communication and data transfer. This equipment is essential for the 
automatic generation of tandem encodings. 


3.3 Facility operation 


Referring to the lower left-hand portion of Fig. 2, a segment of an 
analog signal such as speech is derived from a tape recorder, telephone 
set, or other source. The signal is bandlimited, quantized, and stored on 
either tape or disk. The choice of tape or disk is determined from two 
considerations, the sampling rate of the A/D converter and the length 
of the analog segment to be digitized. The disk can store samples at rates 
up to 72 kHz, the magnetic tape up to 16 kHz. However, a 2400-foot tape 
reel can store 10 million words, the disk only 1.25 million words. 

After the digits are stored on either tape or disk, they can be processed 
by a codec algorithm and stored back on magnetic tape or disk in prep- 
aration for playback. The output signal of the D/A converter is passed 
through a reconstruction filter and the codec simulation is essentially 
completed. The resultant analog signal can be stored on an audio tape 
recorder as indicated by the “single encoding” path in Fig. 2. 

If a tandem encoding is to be simulated, the “switch” following the 
reconstruction filter is positioned to the input of the A/D converter of 
the 11/20. As the playback operation is proceeding on the 11/40, a si- 
multaneous acquisition process is proceeding at the 11/20. The D/A on 
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the 11/40 and the A/D on the 11/20 can be timed by independent clocks. 
Hence, the overall D/A-A/D process can be performed asynchronously. 
After the acquisition at the 11/20 is complete, the 11/40 retrieves the 
samples from the disk system of the 11/20 via the UNIBUS window. The 
samples then reside on the magnetic tape unit or disk system of the 11/40. 
A second codec processing can take place in an identical manner as de- 
scribed above. A tandem encoding loop is thus defined. This loop is il- 
lustrated in the upper portion of Fig. 2. A signal can be made to traverse 
this loop as many times as desired, with no restrictions on sampling rate 
or codec type for each successive encoding. When the desired number 
of tandem encodings are completed, the signal can be stored on audio 
tape as in the single encoding case. Both the single and tandem encoding 
operations are performed under the automatic control of the software. 
Manual intervention after each loop is not necessary. 


IV. SUBJECTIVE TESTING—TECHNIQUES 


The subjective tests described in this paper were all conducted as 
listening-only tests (not conversational). The subjects listened to pre- 
recorded speech and voted on the perceived quality. Details concerning 
the test facilities, selection of subjects, test circuitry, and test adminis- 
tration are covered in this section. 


4.1 Description of tests and facilities 


The subjective tests were conducted in an acoustically treated test 
room containing 11 cubicles permitting up to 11 subjects to be tested 
simultaneously. Each cubicle contains a handset over which test con- 
ditions are heard and a keyboard with five keys labeled “excellent,” 
“good,” “fair,” “poor,” and “unsatisfactory,” which is used for registering 
the vote for each test condition. Associated with the keyboard are red 
indicator lights which are lit during the presentation of a test condition 
and green indicator lights which are lit to indicate the period for voting 
on the test condition. 

The votes of each subject are recorded using a minicomputer system, 
a keyboard interface, and associated programs. A terminal permits 
monitoring of the ratings during the tests. All votes are recorded on 
magnetic tape for subsequent analysis. 

At the start of the test session, a start signal is sent by the test ad- 
ministrator to the computer which then remotely actuates the tape re- 
corder and turns on the red keyboard lights. At the end of each test 
condition, a tone recorded on the second track of the test tape is recog- 
nized by a tone detection circuit which signals the computer to stop the 
recorder and turn on the green voting lights. The computer then collects 
the votes. After all the votes are received from the subjects or after a 
3-second timeout period (whichever occurs first), the computer extin- 
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guishes the green voting lights, turns on the red keyboard lights, and then 
starts the tape recorder for the next test condition. 


4.2 Test playback system 


The system used for the subjective tests is shown in Fig. 3. A dual-track 
tape recorder (Ampex 440G) equipped with a Dolby noise reduction unit 
is used to drive a standard 500-type telephone set (with a receiving rating 
efficiency of 21 dB) connected to 6 kft of 26-gauge, nonloaded cable and 
a 400-ohm, 48-Vdc feeding bridge (central office battery supply circuit). 
The carbon transmitter is replaced with a 90-ohm resistor to eliminate 
any room noise pickup. The master telephone set receiver is replaced 
by a 120-ohm resistor to avoid possible introduction of acoustic/inductive 
interference in the test conditions; a transformer-amplifier bridge on 
this resistor drives the 11 telephone set receivers in the cubicles as shown 
in Fig. 3. 

The measured responses of the playback system are shown in Figs. 
4 and 5. These responses reflect adjustment of the listening amplifier 
such that a speech level of —29 vU (volume units) at the line terminals 
of the telephone set (point V2 of Fig. 3) produces an acoustic pressure 
of —12 dBPa* (82 dB relative to 20 uPa) for speech power averaged across 
the 11 receivers. The value of —12 dBPa approximates the preferred 
speech pressure level. 


4.3 Subject selection and test administration 


Subjects were selected from employees in various job classifications 
and age groups at Bell Laboratories in Holmdel, N.J. The sheer number 
of test conditions dictated that the total test program be divided into 
five tests. All the tests were administered independently of one another 
using different subjects. The number of subjects and breakdown ac- 
cording to sex is given in Table I. Discussions of the testing throughout 
the remainder of this paper will be structured under the three topics of 
(i) single encodings, (11) tandem encodings, and (iz) the local, exchange, 
and toll reference connections. 

For each test session, a maximum of 11 test subjects were seated in 
the test cubicles and supplied with test instructions shown in Fig. 6. The 
test administrator read the instructions and told the subjects that there 
would be four practice test conditions given before the start of the test. 
The subjects were told to vote on the four practice conditions as if they 
were part of the actual test. After the practice conditions were presented 
and voted on, the test administrator would enter the test room to answer 
any questions before the actual testing began. 


* dBPa = dB relative to 1 Pascal which corresponds to 1 newton per square meter. 
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Fig. 3—Multiple listening playback system. 
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Fig. 4—Test signal levels. 
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Fig. 5—Response measurements for playback system. 
4.4 Speech sources and anchoring conditions 


The source tapes were made by recording male and female speech from 
the output of a 500-type telephone set carbon transmitter through a 111C 
repeater coil and a simulated 6-kft loop using a Dolby noise reduction 
unit. These sources were then processed by the various codec algorithms 
to produce the test conditions. 

Analog noise conditions were included in each test session along with 
codec conditions. The analog noise conditions were generated by adding 
white noise bandlimited from 200 Hz to 3.4 kHz to the speech so that, 
with the speech level set at —29 VU at the 500-type telephone set ter- 
minals, the noise is at a level of 10, 20, 30, or 40 dBrnC. The inclusion of 
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Table !—Number of subjects 


Test Total Male Female 


(i) =ingle Encodings—Variation of Line Bit Rates and 51 37 14 
Levels 
(it) Single Encodings—Error Rate 52 35 17 
(iii) Tandem Encodings plus Local and Exchange 53 49 4 
Reference Connections 
(iv) Toll Reference Connections—Part 1 56 46 10 
(v) Toll Reference Connections—Part 2 54 50 4 


THIS EXPERIMENT ITS DESIGNED TO STUDY THE EFFECTS OF VARLOUS 
IMPATRMENTS ON TELEPHONE ‘TRANSMISSION QUALITY. YOUR TASK 

IS TO LISTEN TO THE TEST CONDITIONS AND, AFTER EACH CONDITION, 
MAKE A JUDGMENT OF THE TTRANSMISSTON QUALITY. A JUDGMENT CAN 
BE MADE IN ONE OF FIVE CATEGORIES: EXCELLENT, GOOD, FAIR, 
POOR, AND UNSATISFACTORY. 


THE EXPERIMENT WILL CONSIST OF WO PARTS, EACH CONTAINING 
TEST CONDITIONS, WITH A REST PERIOD BETWEEN THE TWO PARTS. 
THE TOTAL ‘TIME OF THE TEST WILL BE ABOUT 60 MINUTES. 

WHEN TH SIGNAL IS GIVEN, PLEASE PICK UP THE WHITE PRINCESS® 
TELEPHONE HANDSET IN FRONT OF YOU AND HOLD IT TO YOUR TELE- 
PHONE LISTENING EAR. FOR BACH TEST CONDITION, YOU WILL HEAR 
THREE SENTENCES. AT THE END OF THE SENTENCES, A GREEN LIGHT 
ON THE KEYBOARD IN FRONT OF YOU WILI BE LIGHTED. DURING THE 
TIME THE GREEN LIGHT IS LIGHTED, YOU ARE TO RATE THE TRANS- 
MISSION QUALITY OF THE TEST CONDITION YOU JUST HEARD BY PUSH- 
ING ONE OF THE FIVE MARKED BUTTONS ON THE KEYBOARD AS FOLLOWS: 
EXCELLENT, GOOD, FAIR, POOR, AND UNSATISFACTORY. 

PLEASE FILL OUT THE CARD IN FRONT OF YOU. 

ARE THERE ANY QUESTIONS? 


Fig. 6—Test instructions. 


these noise conditions allowed the codec results to be referenced or an- 
chored into the body of information that has been accumulated over the 
years on the effects of random noise. It is estimated that the background 
noise contribution frem the original source tape was approximately 4 
dBrnC measured at **e line terminals of the telephone set. 

Speech-correlated noise conditions were included because they ap- 
proximate 1255 PCI quantizing noise and can be utilized in future 
modeling efforts. These conditions were produced by a device called the 
Modulated Noise Reference Unit (MNRU).? This introduces a noise 
signal to the input speech, which is directly correlated to the instanta- 
neous amplitude of the speech. The speech-correlated noise conditions 
were designated Q = 5, Q = 10, etc., where Q is equal to the decibel value 
of the ratio of the speech power to speech correlated noise power. 

In addition to the reasons given above, the analog noise and Q con- 
ditions served as control conditions by being included in every test ses- 
sion as a check on session-to-session differences. 
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4.5 Analysis of results 


The votes from each test condition are combined into a vote histogram 
with the votes for male and female speech pooled together. These his- 
tograms contain the number of votes recorded for each of the comment 
categories “excellent,” “good,” “fair,” “poor,” and “unsatisfactory” as 
represented by the category numbers 5, 4, 3, 2, and 1, respectively. A 
Mean Opinion Score (MOS) is calculated for each test condition by taking 
the arithmetic mean of the category numbers voted. A sample standard 
deviation (c) is also calculated for each vote histogram. 

The male and female results are combined because a simple regression 
analysis of the data indicated that there was little difference between 
the subjective responses for male and female speakers (about 0.2 to 0.4 
of a category point). The overall standard deviation of the data is on the 
order of 0.6 to 0.7 of a category point. 


V. SUBJECTIVE TESTING—SINGLE ENCODINGS 


The first round of subjective testing involves single encodings of 4255 
PCM, NIC, ADPCM, and SLC ADM as a function of three parameters; line 
bit rate, input level and received volume (listening level) pairs, and line 
error rate. . 


5.1 Test design 


Table IIA lists the test conditions as a function of line bit rate. Three 


Table I[A—Single encoding versus line bit rate conditions 


Algorithm Bit Rate (kb/s) 
Continuous Law PCM (mid-tread bias) 16,40,64 
15-Segment PCM (mid-tread bias) 32,40,48,64 
15-Segment PCM (mid-riser bias) 32,48 
NIC PCM 19,35,43,51,59 
ADPCM (with step-size leak) 16,24,32,40,48 
ADPCM (without step-size leak) 16,24,32,48 
SLC™ ADM 24,37.7, 48 


Table IiB—Single encoding versus level variation conditions 


Algorithm Bit Rate (kb/s) 
15-Segment PCM (mid-tread bias) 48 
NIC PCM 51 
ADPCM (with step-size leak) 48 
SLC™ADM 48 

Input Level (vu) Received Volumes (VU) 
—0.2 —20.7,—29.0 
—10.5 —20.7,—29.0,—38.3 
—20.7 —29.0,—38.3 
—31.0 —38.3 
—41.2 —47.0 
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Table IlIC-——Single encoding versus error rate conditions 


Algorithm Bit Rate (kb/s) 
15-Segment PCM (mid-tread bias) 48 
NIC PCM 51 
ADPCM (with step-size leak) 48 
SLC™ ADM 48 


versions of the 4255 PCM algorithm are included mainly for verifying that 
the subjective differences between continuous law, 15-segment mid- 
tread, and the 15-segment mid-riser algorithms are negligible. ADPCM 
is simulated with and without step-size leak. To economize on the total 
number of test conditions, various line bit rates are excluded with care 
taken so that subjects are exposed to a wide subjective quality range and 
the subjective ratings of these exclusions can be estimated by interpo- 
lation. The input speech level for these conditions is —24.8 VU (~ —23.4 
dBm), the average reported by McAdoo!?® for local calls over the Bell 
System message network. The measured idle channel noise of each codec 
is given in Table III. The received volume presented to the subject is 
—29 VU, measured at the line terminals of the telephone set. 

The conditions for single encoding as a function of input level and 
received volume are arrived at by estimating Bell System speech volumes 
and network losses for five types of network connections: (i) intra- 
building, (ii) interbuilding over a direct trunk, (vii) interbuilding over 
two tandem trunks, (iv) and (v) long and short connections over the 
intertoll network. For each of these five situations, the encoder and de- 
coder of a codec are postulated to be located in the telephone set, end 
office, and at an intermediate point in the loop as in the example of a 
remote switching or pair gain system. Speech volume!? and loss!! dis- 
tributions are used to derive the input level and received volume ranges 
to be tested. A condition is then defined by quantizing these ranges and 
assigning input level-received volume pairs to a codec where the received 
volume is always less than the input level. The four codecs are imple- 
mented at a single bit rate which is equal to 48 kb/s or as close to 48 kb/s 
as possible (51 kb/s in the case of NIC). This particular line bit rate is 
chosen for two reasons: (1) the large number of possible test conditions 
to be evaluated dictated the use of a single line bit rate, and (iz) that line 
bit rate should be 48 kb/s since experience has shown that the distortion 
introduced by a single encoding of 4255 PCM at 48 kb/s can be barely 
perceived and it is desirable to ascertain and compare the performance 
of the other codecs at a comparable line bit rate. Table IIB lists the 
codecs and the input level-received volume matrix used in this section 
of testing. 

The third and final portion of the single encoding tests is concerned 
with the subjective effect of random transmission errors. Independent 
errors are introduced between the encoder and decoder at rates 107}, 
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10-2, 10-8, 10-4, and 10-5 errors/bit. The four codecs are chosen to op- 
erate at a line bit rate equal to or nearly 48 kb/s for the same reasons 
given in the previous paragraph. The input level is set at —12.4 VU so that 
the entire dynamic range of the PCM and NIC codecs is exercised without 


overloading. A summary of the error rate test conditions appears in 
Table IIC. 


5.2 Results and observations 


The combined results for single encodings as a function of line bit rate, 
input level and received volume, and line error rate are given in the ap- 
pendix, Tables V, VI, and VII, respectively. Most of these tabulated 
results are summarized in Figs. 7 through 10. 

Figure 8 is a plot of the mean opinion score (MOS) versus line bit 
rate for the codecs of Table IIA. It is recalled that these results are ob- 
tained using an input speech level of —24.8 VU with a received volume 
of —29 VU, representing a 4.2-dB loss located after the decoder. An im- 
mediate observation that can be drawn is the subjective advantage of 
all the adaptive codecs (NIC, SLC ADM, and ADPCM) over the 1255 PCM. 
Below a line bit rate of 48 kb/s, this advantage is on the order of 12 to 16 
kb/s for equivalent subjective ratings. At 48 kb/s and above, the leveling 
off of the response curves is due to two effects. The first important factor 
is the absolute level of idle channel noise the subjects hear. The idle 


Table !II—Single encoding idle channel noise measurements 
Bit Rate Idle Channel Noise 

Algorithm (kb/s) (dBrnC) 
True Logarithmic PCM (mid-tread) 16 16.0 
True Logarithmic PCM (mid-tread) 40 16.0 

True Logarithmic PCM (mid-tread) 64 16.0 . 
15-Segment PCM (mid-riser) 32 37.7 
15-Segment PCM (mid-riser) 48 26.3 
15-Segment PCM (mid-tread) 32 16.0 
15-Segment PCM (mid-tread) 40 16.0 
15-Segment PCM (mid-tread) 48 16.0 
15-Segment PCM (mid-tread) 64 16.0 
NIC PCM 19 31.4 
NIC PCM 35 19.6 
NIC PCM 43 16.0 
NIC PCM 51 16.0 
NIC PCM 59 16.0 
ADPCM (with step-size leak) 16 27.1 
ADPCM (with step-size leak) 24 23.1 
ADPCM (with step-size leak) 32 18.6 
ADPCM (with step-size leak) 40 18.4 
ADPCM (with step-size leak) 48 18.4 
ADPCM (without step-size leak) 16 25.2 
ADPCM (without step-size leak) 24 22.3 
ADPCM (without step-size leak) 32 18.7 
ADPCM (without step-size leak) 48 18.4 
C ADM 24 23.3 
SLC ADM 37.7 15.5 
SLC ADM 48 13.3 
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Fig. 8—Subjective results—single encodings vs line bit rate. 


channel noises of the higher bit rate codec conditions listed in Table III 
are in the range of 13.3 to 26.3 dBrnC. The 4.2-dB loss following the 
decoders translates this range into 9.1 to 22.1 dBrnC at the line terminals 
of the telephone set. As these tests were conducted in an acoustically 
shielded room, the noise results of Fig. 7 show that subjects can react to 
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Fig. 10—Subjective results—line error rates. 


noise levels on the order of 10 dBrnC, since the curves do not level off 
in this vicinity. Thus, the leveling-off tendency in Fig. 8 is in line with 
the noise response curves of Fig. 7 for the lower values of noise. A second 
factor may be the distortion and bandlimiting introduced by the trans- 
mitting and receiving 500-type telephone set pair. 

Two other important observations can be extracted from Fig. 8. The 
first is that there appears to be no appreciable differences among the 
three versions of 4255 PCM tested, at least for bit rates greater than 
32 kb/s. The other observation is that the introduction of step-size leak 
in the ADPCM algorithm (see Section 2.3) has a small effect on subjective 
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quality in the vicinity of 32 kb/s under these conditions. The intent here 
is not to determine the optimal amount of leak to be introduced but to 
demonstrate that leak can affect subjective quality. 

The results for the subjective ratings as a function of input level and 
received volume are shown in Fig. 9. The results are grouped according 
to a fixed received volume with a varying input level. The solid lines 
represent the preferred received volume of —29 VU with the dotted and 
dashed lines representing received volumes of —20.7 VU and —38.3 VU, 
respectively. The cluster of four points on the left-hand side of Fig. 9 
represents the —47.0 VU received volume where only one corresponding 
input level of —41.2 VU is used. Each curve is a response for a particular 
codec and is labeled accordingly. 

The MOS results in Fig. 9 are due to a combination of effects: 
(t) quantizing distortion, (ii) the received volume level that the subject 
hears at the telephone set, (117) the absolute level of idle channel noise 
at the set terminals which is a function of both the input level and receive 
volume, and (iv) overload distortion which is manifested as ampli- 
tude-limiting for PCM and NIC and slope overload for the ADPCM and 
SLC ADM codecs. All four of these effects must be considered collectively 
when interpreting these plots. With these caveats in mind, a few obser- 
vations can be made here with detailed analyses left to future studies. 

The NIC algorithm is rated significantly better than the PCM algorithm 
for nearly all input levels and received volumes. This is an expected re- 
sult, since the NIC codec readjusts its dynamic range for each block of 
eight samples. The amplitude overload point of both the PCM and NIC 
codecs for the speech sources in this study occurs at an input level of 
approximately —12.4 vU. Thus, the subjective ratings of PCM and NIC 
are the greatest for the input level of —10.5 VU, where only a small 
number of samples are clipped and the entire dynamic range is fully 
exercised. At the input level of —0.2 vu, the peaks of the speech are 
roughly 12 dB above overload and the MOS ratings fall off for PCM and 
NIC at both received volumes of —20.7 and —29 VU. 

The SLC ADM and ADPCM codecs hold up somewhat better for high 
input levels of speech, confirming that slope overload is “more” tolerable 
than amplitude overload. The ADPCM and SLC ADM codecs are rated 
comparably for the input levels of —10.5 and —20.7 vu. 

Two additional general observations can be inferred from Fig. 9. First, 
the set of curves for the received volume of —29 VU tends to have higher 
MOS ratings than the sets for received volumes of —20.7 and —38.3 VU. 
This can be attributed to three causes: (i) —29 VU is the preferred re- 
ceived volume, (ii) the dynamic ranges of the PCM and NIC codecs are 
exercised fully with essentially no overloading, and (iii) the idle channel 
noises heard by the subjects are low because of the high input levels 
(0 to 4 dBrnC for the input level of —10.5 vU). These effects lead to 
higher MOS values than those shown for the same codecs in Fig. 8. The 
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second observation from Fig. 9 is that for the lowest received volume 
tested, —47.0 VU, the adaptive codecs are scored equivalently with PCM 
receiving an even lower rating due to its coarse quantization of the speech 
at the input level of —41.2 VU. 

The final portion of the single encoding testing 1s concerned with the 
effect of line error rates on the four codecs in Table IIC. The results are 
tabulated in Table VII and shown in Fig. 10, where MOS is plotted versus 
the errors per bit on a logarithmic scale. Note that the response curves 
converge at low (107°) and high (107!) error rates. The 107! rate is suf- 
ficiently severe so that the four codecs are all rated “unsatisfactory” and 
the 10~° error rate is virtually undetectable, and the four codecs are rated 
“good.” The leveling-off of the four curves for low error rates at MOS 
values of 4.2 to 4.5 is slightly higher than the asymptotic value in Fig. 
8. This is due to the fact that the input level of —12.4 VU and the corre- 
sponding receive value of —29 VU result in lower idle circuit noises as 
measured at the line terminals of the telephone set. 

Significant differences among the codecs are only manifested in the 
area between 10~4 and 107? errors per bit. The SLC codec is the least 
sensitive to line errors while PCM is the most sensitive. NIC and ADPCM 
(with step-size leak) are rated nearly the same and fall between the SLC 
ADM and PCM extremes. Although the ADPCM codec without step-size 
leak was not formally tested in the presence of errors, informal listening 
tests have shown that it is subjectively comparable to the PCM codec. 


VI. SUBJECTIVE TESTING—TANDEM ENCODINGS 
6.1 Test design 


This section contains a description of the tandem encoding conditions 
listed in Table IV. Basically, the four codec algorithms of the previous 
section are used here at a few selected bit rates, again because of the 
constraint of economizing on the total number of test conditions. Evi- 
dence from the single encoding versus line bit rate tests show that the 
adaptive codecs, NIC, ADPCM, and SLC ADM have a 12- to 16-kb/s ad- 
vantage over “255 PCM for bit rates below 48 kb/s. Thus, it was decided 
to compare 48-kb/s 4255 PCM against NIC, ADPCM, and SLC ADM in the 
vicinity of 32 kb/s. The 8-kHz sampling rate dictated that ADPCM and 
NIC operate at 32 and 35 kb/s, respectively. Since the SLC-40 ADM al- 
gorithm is implemented in the loop plant today, the sampling rate of that 
system, 37.7 kb/s, is used. Finally, there is interest in the tandem per- 
formance of D channel banks, hence 64-kb/s 1255 PCM is included asa 
reference. ADPCM is implemented without step-size leak since subjects 
did perceive some degradation for ADPCM with leak at 32 kb/s. 

The input speech level to the codecs is —20 VU, approximately the 
speech volume averaged over Bell System local and toll connections.!° 
The received volume is set at the preferred level of —29 VU. A tabulation 
of the tandem encoding conditions is given in Table IV. 
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Table |V—Tandem encoding conditions 


Bit Rate Number of 
Codec (kb/s) Tandem Encodings 
PCM 64 1,2,4,6,8 
PCM 48 1,2,4,8 
NIC PCM 35 12'4'8 
ADPCM 32 1,2,4,8 
SLC ™ADM 37.7 1,2,4,8 


To explain the tandem encoding process in more detail, two configu- 
rations are shown in Fig. 11 which are simply expansions of Fig. 2. The 
upper arrangement is applicable to the 8-kHz codecs, PCM, NIC, and 
ADPCM. The filtering used for bandlimiting and reconstruction is re- 
alized with transmit and receive filters with characteristics that are 
similar to those used in D3 channel banks.!? Note that each time the 
tandem encoding loop is transversed, the filters encountered are a 
receive/transmit pair, analogous to a back-to-back D channel bank sit- 
uation. 

Tandem encodings of the 37.7-kb/s SLC-40 ADM are generated ina 
slightly different manner, as shown in the lower portion of Fig. 11. 
Specifically, the differences are manifested in the filtering. The SLC 
input and output filters, which are implemented in software, in con- 
junction with a 6.4-kHz low-pass filter provide sufficient rejection at half 
the 37.7-kHz sampling rate to avoid aliasing. After the desired number 
of tandem encodings have been simulated, a D channel bank transmit 
filter is inserted in the playback path prior to recording so that small 
bandwidth differences between the 8-kHz and 37.7-kHz SLC ADM 
conditions are eliminated. | 

The use of these two filtering strategies results in different in-band 
characteristics as illustrated in Figs. 12 and 13, where the overall re- 
sponses for a single encoding and eight tandem encodings are given for 
the 8-kHz and the 37.7-kHz SLC ADM codecs, respectively. 


6.2 Results and observations 


The tandem encoding MOS results and the breakdown according to 
comment category are tabulated in Table VIII of the appendix. 

These results are shown in Fig. 14, where the MOS ratings are plotted 
as a function of the number of tandem encodings. An immediate obser- 
vation is the superiority of the 64-kb/s 4255 PCM over the other four 
codecs. This superiority is evidenced by the facts that the response curve 
for 64-kb/s PCM begins to fall off at four encodings while the responses 
for the other four codecs fall off at two encodings, and eight encodings 
of 64-kb/s PCM is rated slightly below a MOS of 4 (“good”), while eight 
encodings of the other codecs are rated between 3 (“fair”) and 2 (“‘poor’’). 
Even for eight encodings of 64-kb/s 4255 PCM, the degradation is not 
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Fig. 12—Overall frequency response—8 kHz—one and eight encodings. 


attributable only to quantizing noise. Accumulation of the idle channel 
noise and bandwidth reduction of the tandem filtering are other fac- 
tors. 

Another general observation is that, to a rough approximation, the 
three adaptive codecs (35-kb/s NIC, 32-kb/s ADPCM, and 37.7-kb/s SLC 
ADM) behave in a similar fashion to 48-kb/s PCM. This result concurs 
with the single encoding results of Fig. 8 where it was shown that the 
adaptive codecs exhibited a 12- to 16-kb/s subjective advantage over 
PCM. 

Finally, all the curves in Fig. 14 at one encoding agree well with the 
single encoding results of Fig. 8 after it is recognized that the input level 
here is 4.8 dB greater than that of the single encoding tests. Both the 
single encoding versus line bit rate and tandem encoding tests were 
conducted at the received volume of —29 vu. Thus, the higher input level 
in the tandem encoding tests results in a 4.8-dB reduction in the idle 
channel noises heard by the subjects. 
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Fig. 13—Overall frequency response—37.7 kHz—one and eight encodings. 


Vil. SUBJECTIVE TESTING—REFERENCE CONNECTIONS 


The purpose of the reference connection conditions is to evaluate the 
codecs of the previous section in representative network connections. 

Basically, these reference connections are of three types: local, exchange, 
and toll. The codecs are placed in connections where the environment 
in terms of speech volume, analog loss, and analog noise is defined from 
survey data. The received volumes are not held constant, but vary de- 
pending on the loss in the connection. For all connections, the codec 
characteristics such as overload and idle channel noise are defined in 
Section II. The connections are chosen in such a manner that average 
and worst-case situations are represented. 

Many reference connections involve similar and dissimilar tandem 
encodings. Prior to a detailed description of the different connections, 
a few words on the analog interface between successive encodings are 
appropriate. The four codecs can be grouped into two categories deter- 
mined by whether the sampling rate is 8 kHz or 37.7 kHz. This leads to 
four possible combinations in simulating a tandem encoding because 


SELECTED DIGITAL CODECS FOR SPEECH 3141 


INPUT LEVEL = —20 VU 
RECEIVE VOLUME = —29 VU 


—O 64 kb/sec 1255 PCM 
O_O S48 kb/sec 1255 PCM 
&-— 2 35 kb/sec NIC 
O——_——T:SO37.7 kb/sec SLC™ ADM 
S232 kb/sec ADPCM (WITHOUT STEP SIZE LEAK) 


MEAN OPINION SCORE 





1 2 4 6 8 
NUMBER OF TANDEM ENCODINGS 


Fig. 14—Subjective results—tandem encodings. 


of the filtering that is necessary. These four configurations are shown 
in Fig. 15. The top and bottom are identical to those shown in Fig. 11 
since the sampling rates of the two codecs are the same. The two center 
arrangements in Fig. 15 are the cases where an 8-kHz and 37.7-kHz codec 
are both involved. Here the D channel bank transmit and receive filters 
are utilized with the SLC input and output filters. These four arrange- 
ments apply to the remainder of the discussions on reference connec- 
tions. 


7.1 Test design 
7.1.1 Local reference connections 


The local connection conditions are configured as shown in Fig. 16. 
There are four types of connections, depending on the presence of codecs 
in the loops: (1) near- and far-end loops, both analog, (ii) codec in near- 
end loop, (iii) codec in far-end loop, and (iv) codec in both loops. Note 
that an average loop loss of 3.7 dB! is assumed regardless of the presence 
or absence of a codec. It is recognized that this assumption may be in- 
appropriate when a codec is present, since the loop loss might be reduced. 
However, this loss is fixed to avoid confounding the subjective effect of 
loss variation with that of introducing a codec in the loop. The input 
speech volume is chosen to be the average found for local calls, 
—24.8 vU.!° The assumed loop loss of 3.7 dB translates this into a re- 
ceived volume of about —28.5 VU. 

The central office is modeled as either an analog switch or a 64-kb/s 
255 PCM digital switch. When the office is analog, 16 dBrnC of random 
noise is added to the speech as it traverses the office. This noise value 
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Fig. 15—Dissimilar tandem encoding configuration. 


is chosen to correspond to the digital switch where 16 dBrnC is added 
after the PCM decoder, as described in Section 2.1. In the context of this 
local environment, the local reference connection conditions to be sub- 
jectively evaluated are listed in Fig. 21. In the conditions with a codec 
in both loops, the two codecs may be either identical or dissimilar. For 
the case of two dissimilar codecs, the subjective effect of ordering is in- 
vestigated by including conditions where the physical locations of the 
codecs are interchanged. 


7.1.2 Exchange reference connections 


The exchange reference connection model is shown in Fig. 17. This 
model is a simple extension of the local connection model in that an 
additional central office is connected to the first office via a direct trunk. 
The central offices and loops, whether analog or digital, are modeled as 
in the local reference connections. 
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Fig. 16—Local reference connections. 


The direct trunk facility between the central offices is either analog 
or digital. The loss used in both cases is 3 dB. In addition, 14.8 dBrnC 
of noise derived from survey data is introduced on the analog trunk. On 
the digital direct trunk, the noise introduced is the characteristic idle 
channel noise of the codec. The speech volume in both situations is 
—23.1 vu,!° slightly higher than that found on the local reference con- 
nections. 

To keep the number of conditions down to a mangeable level, codecs 
are introduced into the exchange connection in only one manner. If a 
codec is to appear in the connection, it must appear in all three compo- 
nents of connection simultaneously—both loops and the direct trunk. 
Furthermore, the three codecs must be identical; a mixture of different 
codecs is not allowed. Using these rules, the exchange reference con- 
nection conditions are formulated and tabulated in Fig. 22. 

When both central offices are digital (64-kb/s 4255 PCM) and the co- 
decs in the loops and direct trunk are either PCM or NIC, only a trivial 
code conversion is needed between the office and loop or trunk. The 3-dB 
direct trunk loss can be included in the far-end loop loss, and the entire 
connection collapses into a single encoding. For the case of the three 
ADPCM codecs, the analog link between the codec and digital central 
office can also be eliminated. All that is necessary is an intermediate D/D 
conversion to uniform PCM. Thus, the entire connection is digital where 
the 3-dB loss is a digital loss and is introduced where it is depicted.in Fig. 
ie 
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Fig. 17—Exchange reference connections. 









7.1.3 Toll reference connections 


The toll reference connections are arrived at using the diagram in 
Fig. 18. Essentially, this diagram differs from Fig. 17 in that the direct 
trunks are replaced with the toll network wherein a connection comprises 
two toll-connecting trunks and one or more intertoll trunks. The toll 
network is assumed to be implemented on analog and digital facilities 
where “digital” implies only 64-kb/s 1255 PCM-No. 4 ESS switches, VIF 
terminals, and D channel banks.! The codecs described in the previous 
two sections will only be modeled in loops on the ends of the toll con- 
nection. Thus, a toll reference connection consists of two loops where 
a codec may appear and toll trunks with various mixtures of analog and 
digital 64-kb/s 4255 PCM facilities. 

The local portion of Fig. 18 is similar to that in the local and exchange 
reference connections with the following exceptions. To limit the total 
number of test conditions, only the analog version of the central office 
is configured in toll connections. As before, a 16-dBrnC noise source is 
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Fig. 18—Toll connection diagram. 


3146 THE BELL SYSTEM TECHNICAL JOURNAL, NOVEMBER 1978 


incorporated in the central office model. The average speech volume used 
in the toll reference connections is significantly higher than that found 
in local and exchange connections and was found by McAdoo to be 
—16.8 vu.!° Finally, when two codecs are simultaneously introduced in 
both loops on a particular connection, both codecs are always identi- 
cal. 

Four types of toll connections are considered here: (7) short, (iz) long, 
(iit) “worst-case” long, and (iv) all-digital. The short and long toll con- 
nections are further divided into three subcategories according to facility 
makeup: (a) analog switches and analog transmission facilities, (6) analog 
switches and digital transmission facilities, and (c) digital switches and 
analog transmission facilities. The “worst-case” long connection is simply 
derived from the long toll connection by the introduction of additional 
intertoll trunks. The all-digital connection consists of a single 64-kb/s 
4255 PCM encoding in the toll network with the speech in digital form 
between the transmit and receive central offices. 

The toll reference connections are now described, using Fig. 19. 

(t) Short Toll Connections—The analog connection S consists of 
two toll connecting trunks and a single short intertoll trunk. The toll 
connecting trunks are characterized by a VNL loss of 3 dB and an average 
survey noise source of 11.3 dBrnC.14 The 60-mile intertoll trunk has a 
VNL design loss of 0.5 dB and a projected noise of 20.4 dBrnC.!! The 
analog toll switches are assumed to add noise in an amount equivalent 
to 12 dBrnC. These characterizations of analog transmission and 
switching facilities will apply to all the toll connections discussed from 
this point on. 

The center configuration, Sg, is a modification of S; where the analog 
toll switches are replaced with No. 4 ESS switches and VIF terminals. The 
VIF terminal involves a 64-kb/s 1255. PCM encoding and decoding fol- 
lowed by a 16-dBrnC0O noise source as described in Section 2.1. This 
translates into 13 dBrnC at the —3 TLP point for the speech with the 
codec overload set at +3 dBm0. 

The bottom configuration, S3, is another modification of S; with digital 
transmission facilities and analog switches. All three trunks are analog 
trunks, and VNL loss design applies. However, the noise sources on the 
toll-connecting and intertoll trunks have been replaced with the ap- 
propriate idle channel noises at the decoder side of the D channel 
banks. . 

(it) Long Toll Connections—Referring to configuration L; in the 
diagram, the toll-connecting portion is identical to that of the short 
connection. The intertoll section consists of two intertoll trunks, a short 
(48 mi.) and a long (1300 mi.) one. The short trunk has loss and noise 
characteristics similar to the intertoll trunk in the short connection 
described above. Using VNL design and survey data, the long intertoll 
trunk has a loss of 2.0 dB and a noise of 37.8 dBrnC. 
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Fig. 19—Toll reference connections. (a) Short toll connections. (Figs 19b and 19c on 
following pages.) 


Configuration Ly; is modified in a manner identical to that described 
above for the short toll connection to produce the mixed analog and 
digital connections, Le, and Ls. Of significant interest here is the re- 
duction of noise with the deployment of digital transmission facilities 
in configuration L3. It is expected that this reduction in noise will 
manifest itself subjectively. 

(tit) Worst-Case Long Toll Connection—This condition is dubbed 
“worst case” for two reasons. First, it is constructed from Lg of the long 
toll connections by the addition of three intertoll trunks for a total of 
five intertoll trunks. Network statistics show that a small percentage of 
toll connections are made over five intertoll trunks. Second, the No. 4 
ESS with analog transmission facilities type of connection contains not 
only all the analog impairments such as loss and noise but also the 
64-kb/s 4255 PCM encodings. In this connection, there are a total of six 
PCM encodings in the toll portion and also codecs in the loops. 

(iv) All-Digital Toll Connection—This connection is the simplest 
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Fig. 19(b)—Long toll connections. 
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of the toll connections in that the toll network (switches and trunks) is 
purely digital. Hence, the toll network can be modeled by a single 64-kb/s 
255 PCM encoding implemented on a D channel bank on each toll- 
connecting trunk and a fixed 6-dB loss at the receiving central office. 

The 4255 PCM at 48 and 64 kb/s, 35-kb/s NIC, 32-kb/s ADPCM, and 
37.7-kb/s SLC ADM codecs are incorporated in the loops on the eight 
toll connections described above. 


7.2 Results and observations 


Detailed tabulations of the MOS ratings for the local, exchange, and 
toll reference connection conditions are given in Tables IX through XII 
in the appendix. A cumulative comparison of the analog noise ratings 
is plotted in Fig. 20, where the single encoding noise results are plotted 
along with the tandem encoding and reference connection results to il- 
lustrate a comparison across all the blocks of testing. 
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Fig. 20—Subjective results—additive random noise. 


7.2.1 Local reference connections 


The local reference connection results are summarized in Fig. 21. In 
this figure, the connections of Fig. 16 are redrawn to aid the reader in 
associating the connection with the results. The top configuration of Fig. 
21 is the analog version of the connection where the only impairment 
introduced in the speech is the noise of the analog central office. This 
amounts to 12.3 dBrnC of noise with a speech level of —28.5 VU at the 
telephone set terminals. This condition is rated with a MOS of 4.0 and 
is in agreement with the noise results shown in Fig. 20. 

The second configuration represents the digital central office case 
using 64-kb/s 4255 PCM. As described in Section 2.1, 16 dBrnC of noise 
is introduced into the speech following the decoder so that the speech 
level and idle channel noise at the line terminals of the telephone set are 
identical to those of the analog connection above it. The MOS of this 
connection is 4.0 and it is concluded that the 64-kb/s PCM office intro- 
duces no additional subjective distortion. 

The bottom configuration in Fig. 21 is used to represent all possible 
combinations of codecs in the loops. The table directly beneath it gives 
the MOS ratings for three cases: (7) codec in the transmit loop with an 
analog receive loop, (ii) codec in the receive loop with an analog transmit 
loop, and (iit) codecs in both loops. The first two columns are essentially 
single encodings with the addition of the 16-dBrnC office noise. These 
results are in agreement with the single encoding results of Section 5.2. 
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Fig. 21—Subjective results—local reference connections. 


48 kb/sec 





The third column represents two codecs in tandem with the addition of 
the office noise. It is observed that a second 64-kb/s 4255 PCM encoding 
does not introduce any additional degradation, while the MOS ratings 
for the other four codecs are slightly lower than those for the case of a 
single codec in one loop of the connection. This result is in agreement 
with the tandem encoding results discussed in Section 6.2. . 

The bottom table in Fig. 21 is a matrix of the results for mixed tandem 
encodings, that is, there are codecs in both loops but they are dissimilar. 
Reversal of the ordering of any pair of codecs is indicated by inter- 
changing the row and column indices for any element in the matrix. A 
comparison of the elements in the upper and lower triangular portions 
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of the matrix leads to the following conclusions: (i) the subjective per- 
formance of a tandem encoding involving 64-kb/s PCM and one of the 
four lower bit rate codecs is roughly equivalent to a single encoding of 
the lower bit rate codec, and (it) the subjective effects of ordering in 
dissimilar tandem encodings are small for the five codecs discussed 
here. 


7.2.2 Exchange reference connections 


The exchange reference connection results are shown in Fig. 22. In 
keeping with the format of the local reference connections, the top 
configuration in Fig. 22 represents the all-analog exchange connection. 
Here, 19.5 dBrnC of noise appears with the —29.8 VU speech at the 
telephone set terminals and is rated with a MOS of 3.6 by the subjects. 

The second connection in Fig. 22 is identical to the analog connection 
except that three identical codecs are introduced in the exchange trunk 
and both loops. Note that the 14.8-dBrnC noise on the analog exchange 
trunk is effectively replaced by the idle channel noise of the codec. The 
table immediately below this connection lists the MOS results for the five 
codecs. The introduction of the three 64-kb/s PCM codecs does not alter 
the MOS rating over that of the analog connection. However, three en- 
codings of one of the other four codecs degrades the connection some- 
what. 

The last configuration in Fig. 22 is a modification of the second con- 
figuration which is realized by replacing both analog central offices with 
digital central offices. As explained in Section 7.1, the resulting con- 
nection can be represented as either a single encoding, a series of syn- 
chronous tandem encodings with intermediate D/D conversions, or a 
series of asynchronous encodings with intermediate analog links, de- 
pending on code compatibility between the 64-kb/s PCM central offices 
and the codecs on the loops and exchange trunk. For 64-kb/s PCM, 48- 
kb/s PCM, and 35-kb/s NIC, the connection from the encoder of the 
transmit codec to the decoder of the receive codec collapses into a single 
encoding. The 3-dB exchange trunk loss is incorporated in the loop loss 
following the decoder. Thus, the first three entries in the table under this 
connection represent single encodings of these codecs. Consequently, 
the reduction of noise over the all-analog connection results in MOS 
ratings higher than the analog connection rating of 3.6. In the case of 
ADPCM, the successive encodings are performed synchronously with 
a corresponding reduction in noise, and the connection is rated nearly 
equivalent to the analog connection. The final entry for the SLC ADM 
codec is virtually identical to the analog central office case because all © 
the noise sources are unchanged and the tandem encodings are per- 
formed asynchronously with the analog central offices replaced by 
64-kb/s PCM digital central offices. 
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Fig. 22—Subjective results—exchange reference connections. 


7.2.3 Toll reference connections 


The final portion of the subjective testing is concerned with the effects 
of codecs in toll reference connections. The codecs are placed in loops 
on the ends of the eight toll connections of Fig. 19. This is accomplished 
in two steps. The first step is the placing of a codec in the receive loop 
only and, in the second step, identical codecs are placed in both loops. 

Figure 23 displays the results for a codec in the receive loop only. The 
results for the longer toll connections on which 37.8-dBrnC of noise 
appears demonstrate that the noise is the dominant impairment and the 
effects of the codecs are of little importance (see connections Ly, Le, and 
W). The MOS ratings of these connections are tightly grouped in the area 
of a MOS of 3 (“fair’’), indicating that the subjects are reacting only to 
the noise source. Note that the noises on the intertoll trunks translate 
to approximately 31 dBrnC of noise at the line terminals of the telephone 
set for these three connections. A check with the noise results of Fig. 20 
indicates agreement. It is also observed that in toll connection Ls, the 
37.8-dBrnC noise is eliminated because of the deployment of digital 
transmission systems on the intertoll trunks. This shifts the MOS ratings 
upward in line with those obtained for the short (Sj, Se, S3) and all-digital 
(D) connections. 

The toll connections which received the higher subjective scores (Sj, 
So, S3, Ls, and D) show that, if the quality of the connection is good, 
subjects can discriminate among the five codecs and the MOS ratings are 
spread out over a point or so. However, it is observed that, even though 
the discrimination between one codec and the next may be small, on most 
connections the 64-kb/s PCM codec case is rated nearly equivalent to the 
case where both loops are analog. Also, a comparison of connections S; 
versus Sg and L, versus Ly with both loops analog serves to demonstrate 
that the addition of 64-kb/s PCM codecs in the analog toll portion of the 
connection does not significantly alter the subjective performance of the 
connection. 








x ANALOG LOOPS 

@ 64 kb/sec PCM 
21 O 48 kb/sec PCM 

A 35 kb/sec NIC 

O 32 kb/sec ADPCM 

+ 37.7 kb/sec SLC™ 


1 Nl _t | Nl ea ait. Si 


MEAN OPINION SCORE 


= 





S; Sy S i * L, W D 


TOLL CONNECTIONS (CODEC IN RECEIVE LOOP ONLY) 


Fig. 23—Subjective results—toll reference connections. 
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The introduction of codecs in both loops of the toll connections results 
in the ratings of Fig. 24. Many of the observations for Fig. 23 apply here 
with minor modification. The toll connections of L;, Ly, and W are rated 
the same with codecs in both loops when compared to the results for a 
codec in the receive loop only. In nearly all of the connections, the 64-kb/s 
PCM and analog loop cases again receive nearly equivalent ratings. The 
only differences between the results of Figs. 23 and 24 are those for 
connections with the higher ratings (S,, Se, S3, L3, and D). Here the 
spread among the MOS results for the lower bit rate codecs has increased 
because codecs are introduced in both loops. This indicates that subjects 
can perceive an additional degradation for two encodings of the lower 
bit rate codecs over a single encoding, a result already demonstrated in 
the tandem encoding results of Section 6.2. 


Vill. CONCLUSIONS 


Several important conclusions can be drawn from the results presented 
in this paper for speech: 

(i) The 64-kb/s 1255 PCM codec can be used, with very few restric- 
tions, in the telephone network without affecting speech performance. 
It can be tandemed up to eight times without introducing serious sub- 
jective degradation. It can be inserted in a variety of analog network 
connections with essentially no subjective penalty. However, it is shown 
that eight encodings of either 48-kb/s 4255 PCM, 35-kb/s NIC, 32-kb/s 
ADPCM, or 37.7-kb/s SLC ADM introduce significant subjective degra- 
dations. 
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Fig. 24—Subjective results—toll reference connections. 
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This conclusion supports the general application of the 64-kb/s 1255 
PCM in the combined analog and digital network which is likely to exist 
for many years to come. The use of codecs with lower bit rates would 
require more stringent application rules to avoid excessive degradation 
in tandem arrangements. 

(ii) The 10-6 error rate minor alarm level for the toll switched digital 
network,! which uses 64-kb/s 1255 PCM, is more than adequate for speech 
since the results of this study show that the subjective degradation at 
a 10-5 error rate is negligible. In other words, if error rates could be 
guaranteed to never exceed 10~° errors/bit, then error rate would not 
be a consideration for the four 48-kb/s codecs tested. 

(tii) Connection degradation is dominated by high levels of random 
noise that may be found on long toll connections. However, the 64-kb/s 
4255 PCM all-digital toll connection shows that the network performance 
will improve as the network becomes increasingly digital. As the network 
evolves, the performance should not be unduly limited by the choice of 
codec deployed in the loops. 

(tv) For single encodings, the results show that the three adaptive 
coding schemes (NIC, ADPCM, and SLC ADM) have about a 12- to 16-kb/s 
advantage over 4255 PCM for equivalent subjective ratings. This ad- 
vantage can also be inferred from the tandem encoding and reference 
connection results where 48-kb/s PCM has approximately the same rating 
as 35-kb/s NIC, 32-kb/s ADPCM, and 37.7-kb/s SLC ADM. 

(v) At 48-kb/s, the level variation and error rate tests demonstrate 
that the three adaptive codecs are slightly superior to PCM under these 
conditions. 

(vi) For the three adaptive coding algorithms (NIC, ADPCM, and SLC 
ADM), no single algorithm is significantly superior to the other two on 
an overall basis. All three perform comparably, with small variations over 
the range of conditions tested. 
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APPENDIX 


Table V—Detailed tabulation of subjective scoring 
for single encoding vs line bit rate tests 


Bit Rate % % % % % 
Codec (kb/s) Votes Exc Good Fair Poor Uns. MOS o 
PCM 32 102 0.0 1.0 31.4 54.9 12.8 2.2 0.66 
PCM 40 102 2.0 20.6 56.9 18.6 2.0 3.0 0.74 
PCM 48 102 15.7 47.1 32.4 49 0.0 3.7 0.78 
PCM 64 101 35.6 49.5 14.9 0.0 0.0 4.2 0.68 
PCM} 32 102 0.0 2.0 30.4 62.8 49 2.3 0.59 
PCM1 48 101 4.0 43.6 42.6 9.9 0.0 3.4 0.72 
PCM2 16 102 0.0. 0.0 0.0 6.9 93.1 1.1 0.25 
PCM2 40 101 3.0 25.7 48.5 22.8 0.0 3.1 0.77 
PCM2 64 102 30.4 53.9 15.7 0.0 0.0 4.2 0.66 
NIC 19 100 0.0 1.0 27.0 49.0 23.0 2.1 0.73 
NIC 35 102 10.8 38.2 43.1 7.8 0.0 3.5 0.79 
NIC 43 101 22.8 58.4 18.8 0.0 0.0 4.0 0.64 
NIC 51 101 30.7 54.5 13.9 1.0 0.0 4.2 0.68 
NIC 59 101 33.7 53.5 10.9 2.0 0.0 4.2 0.70 
ADPCM 32 101 19.8 41.6 36.6 2.0 0.0 3.8 0.77 
ADPCM 48 102 31.4 50.0 17.6 1.0 0.0 4.1 0.72 
ADPCM} 16 101 0.0 0.0 12.9 61.4 25.7 1.9 0.61 
ADPCM} 24 102 1.0 9.8 51.0 38.2 0.0 2.7 0.67 
ADPCM] 32 102 3.9 37.3 51.0 6.7 1.0 3.4 0.71 
ADPCM} 40 101 19.8 48.5 26.7 5.0 0.0 3.8 0.80 
ADPCM] 48 102 30.4 55.9 12.8 1.0 0.0 4.2 0.67 
SLCO™ 24 102 2.0 20.6 57.8 18.6 1.0 3.0 0.71 
SLC 37.7 101 11.9 46.5 36.6 5.0 0.0 3.7 0.75 
SLC 48 102 29.4 51.0 15.7 3.9 0.0 4.1 0.78 
Noise 10 dBrnC 203 34.0 47.3 17.2 1.5 0.0 4.1 0.74 
Noise 20 dBrnC 204 9.8 30.9 51.5 7.8 0.0 3.4 0.77 
Noise 30 dBrnC 203 1.5 15.3 46.8 35.0 15 2.8 0.76 
- Noise 40 dBrnC 203 1.5 3.5 26.6 54.2 14.3 2.2 0.79 
Q—5 dB 203 0.0 0.0 0.0 32.0 68.0 1.3 0.47 
Q—10 dB 204 0.0 0.5 18.1 54.4 27.0 1.9 0.68 
Q—15 dB 204 0.0 10.3 48.5 35.8 5.4 2.6 0.74 
Q—20 dB 203 2.0 32.0 53.2 12.8 0.0 3.2 0.69 
Q—25 dB 203 23.2 50.3 25.1 1.5 0.0 4.0 0.73 
Notation: 


PCM = 15-segment PCM (mid-tread) 
PCM, = 15-segment PCM (mid-riser) 
PCM2 = continuous law PCM (mid-tread) 
NIC = NIC PCM 

ADPCM = ADPCM (without step-size leak) 
ADPCM, = ADPCM (with step-size leak) 
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Table VI—Detailed tabulation of subjective scoring 
for single encoding vs level variation tests 


% % % % % 

Codec Levels Votes Exc. Good Fair Poor Uns. MOS o 
PCM I,,Ri 51 5.9 814 41.2 19.6 2.0 3.2 0.89 
PCM ,,Re 51 5.9 25.5 52.9 15.7 0.0 3.2 0.77 
PCM I5,Ri 51 33.3 37.38 19.6 9.8 0.0 3.9 0.96 
PCM To,Re 51 45.1 48.1 11.8 0.0 0.0 4.3 0.68 
PCM L,R3 51 13.7 43.1 39.2 3.9 0.0 3.7 0.76 
PCM I3,Re 51 19.6 54.9 25.5 0.0 0.0 3.9 0.67 
PCM I3,R3 51 17.7 48.1 387.3 2.0 0.0 3.8 0.76 
PCM 14,R3 51 0.0 41.2 56.9 2.0 0.0 3.4 0.53 
PCM I5,R4 51 0.0 2.0 31.4 529 13.7 2.2 0.69 
NIC I,,Ri 51 15.7 49.0 31.4 3.9 0.0 3.8 0.76 
NIC ],,Re 51 5.9 25.5 47.1 21.6 0.0 3.2 0.838 
NIC To,Ri 50 34.0 38.0 24.0 4.0 0.0 4.0 0.86 
NIC Ip,Re 51 54.9 41.2 3.9 0.0 0.0 4.5 0.57 
NIC, Io,R3 51 27.5 47.1 25.5 0.0 0.0 4.0 0.73 
NIC I3,Re 51 27.5 70.6 2.0 0.0 0.0 4.3 0.48 
NIC I3,R3 51 35.38. 85.8 29.4 0.0 0.0 4.1 0.80 
NIC l,,R3 51 216 41.2 33.3 3.9 0.0 3.8 0.82 
NIC I5,R4 51 2.0 7.8 549 33.3 2.0 2.8 0.71 
ADPCM 1,,Ri 51 83.3 48.1 15.7 7.8 0.0 ° 4.0 0.90 
ADPCM I,,Re 50 18.0 46.0 28.0 8.0 0.0 3.7 0.84 
ADPCM To,R1 51 31.4 438.1 19.6 3.9 2.0 4.0 0.92 
ADPCM Io,Re 51 85.38 52.9 11.8 0.0 0.0 4.2 0.64 
ADPCM Io,R3 50 40 38.0. 48.0 10.0 0.0 3.4 0.71 
ADPCM I3,Re 50 34.0 40.0 14.0 12.0 0.0 4.0 0.98 
ADPCM I3,R3 51 11.8 41.2 39.2 7.8 0.0 3.6 0.80 
ADPCM I,,R3 51 15.7 45.1 39.2 0.0 0.0 3.8 0.70 
ADPCM I5,R4 51 0.0 15.7 49.0 31.4 3.9 2.8 0.76 
SLC™ T,,R1 51 17.7 37.8 35.8 9.8 0.0 3.6 0.88 
SLC ],,Re 50 16.0 36.0 42.0 6.0 0.0 3.6 0.82 
SLC In,R1 51 43.1 31.4 15.7 98 0.0 41 0.99 
SLC Ib,Re 51 216 549 23.5 0.0 0.0 4.0 0.67 
SLC I5,R3 51 19.6 47.1 29.4 3.9 0.0 3.8 0.78 
SLC Ts,Re 50 34.0 52.0 14.0 0.0 0.0 4.2 0.66 
SLC I3,R3 51 7.8 48.1 37.8 11.8 0.0 3.5 0.80 
SLC l,R3 51 3.9 7.8 60.8 27.5 0.0 2.9 0.70 
SLC I5,R4 50 0.0 4180 42.0 40.0 0.0 2.8 0.73 
Noise 10 dBrnC Ri 51 23.6 39.2 27.5 7.8 2.0 3.8 0.97 
Noise 10 dBrnC R3 51 5.9 60.8 25.5 7.8 0.0 3.7 0.71 
Noise 10 dBrnC R4 51 3.9 9.8 43.1 438.1 0.0 2.8 0.79 
Noise 20 dBrnC Ri 51 3.9 29.4 52.9 11.8 2.0 3.2 0.77 
Noise 20 dBrnC R3 51 17.7 54.9 27.5 0.0 0.0 3.9 0.66 
Noise 20 dBrnC Rg 51 0.0 5.9 47.1 47.1 0.0 2.6 0.60 
Noise 30 dBrnC Ri 51 3.9 11.8 43.1 37.3 3.9 2.8 0.86 
Noise 30 dBrnC R3 51 0.0 15.7 56.9 27.5 0.0 2.9 0.65 
Noise 30 dBrnC R4 51 2.0 2.00 25.5 64.7 §.9 2.3 0.69 
Noise 40 dBrnC Ri 51 0.0 5.9 25.5 41.2 27.5 2.1 0.87 
Noise 40 dBrnC R3 51 0.0 0.0 23.5 58.8 17.6 2.1 0.64 
Noise 40 dBrnC R4 51 2.0 2.0 11.8 66.7 17.6 2.0 0.74 
Q—5 dB Ri 51 0.0 0.0 5.9 19.6 74.5 1.3 0.58 
—5 dB R3 51 0.0 0.0 0.0 27.5 72.5 1.3 0.45 
Q—5 dB Rg 51 0.0 0.0 0.0 15.7 84.3 12 0.36 
Q—10dB Ry 51 0.0 2.0 9.8 49.0 39.2 1.8 0.71 
Q—10dB R3 51 0.0 0.0 11.8 # 62.7 25.56 19 0.59 
Q—10dB R4 51 0.0 0.0 2.0 54.9 43.1 16 0.53 
Q—15 dB Ri 51 9.8 7.8 651.0 29.4 2.0 2.9 0.92 
Q—15 dB R3 51 0.0 3.9 49.0 47.1 0.0 2.6 0.57 
Q—15 dB 4 51 0.0 5.9 7.8 70.6 15.7 2.0 0.68 
Q—20 dB Ri 51 15.7 35.38 41.2 7.8 0.0 3.6 0.84 
Q—20 dB R3 51 3.9 23.5 54.9 17.6 0.0 3.1 0.74 
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Table VI (cont) 


Q—20 dB Rg 51 0.0 3.9 387.3 52.9 5.9 2.4 0.66 
Q—25 dB Ry 50 32.0 34.0 26.0 8.0 0.0 3.9 0.94 
Q—25 dB R3 51 7.8 39.2 39.2 13.7 0.0 3.4 0.82 
Q—25 dB Rg 51 9.8 13.7 43.1 27.5 5.9 2.9 1.02 


Notation: 

PCM = 48-kb/s 15-segment PCM (mid-tread) 

NIC = 51-kb/s NIC PCM 

ADPCM = 48-kb/s ADPCM (with step-size leak) 

SLC = 48-kb/s SLC ADM “ 

hy Io, I, I4, and Is = input levels of —0.2, —10.5, —20.7, —31.0, and —41.2 VU, respec- 
tively 

: ae Re, R3, and R, = received volumes of —20.7, —29.0, —38.3, and —47.0 VU, respec- 

tively 


Table VIlI—Detailed tabulation of subjective scoring 
for single encoding vs error rate tests 


Error % % % % % 

Codec Rate Votes Exc. Good Fair Poor Uns. MOS o 
PCM 1071 207 0.0 0.0 0.0 3.4 96.6 1.0 0.18 
PCM 10-2 204 0.0 2.0 8.3 63.2 26.5 1.9 0.64 
PCM 1073 207 0.5 17.9 66.7 15.0 0.0 3.0 0.59 
PCM 10-4 207 31.4 41.1 26.6 0.5 0.5 4.0 0.80 
PCM 107-5 208 42.8 50.0 7.2 0.0 0.0 4.4 0.61 
NIC 10-1 208 0.0 0.0 0.5 7.7 91.8 1.1 0.30 
NIC 107-2 207 0.0 3.9 33.8 54,1 8.2 2.3 0.68 
NIC 1073 207 12.1 46.4 39.1 2.4 0.0 3.7 0:71 
NIC 1074 207 43.5 44,9 10.1 1.4 0.0 4.3 0.71 
NIC 10-5 208 52.4 41.8 5.8 0.0 0.0 4.5 0.60 

ADPCM _- 107! 207 0.0 0.0 0.0 3.9 96.1 1.0 0.19 
ADPCM 1072 206 0.0 1.5 16.5 65.0 17.0 2.0 0.63 
ADPCM 1073 207 22.7 45.9 28.0 3.4 0.0 3.9 0.79 
ADPCM 1074 208 35.1 52.4 12.0 0.5 0.0 4.2 0.66 
ADPCM 1075 208 41.8 44.2 13.9 0.0 0.0 4.3 0.69 
SLC™ 107} 206 0.0 0.0 3.4 27.7 68.9 1.3 0.54 
SLC 1072 206 1.0 17.5 55.3 25.2 1.0 2.9 0.71 
SLC 10-3 207 40.1 44,9 14.5 0.5 0.0 4.3 0.71 
SLC 10-4 206 46.1 44.7 9.2 0.0 0.0 4.4 0.65 
SLC 10-5 206 43.7 38.8 16.5 1.0 0.0 4.3 0.76 
Noise 10 dBrnC 201 31.3 50.2 17.9 0.5 0.0 4.1 0.70 
Noise 20 dBrnC 206 71.8 37.9 48.5 5.8 0.0 3.5 0.72 
Noise 30 dBrnC 205 3.4 14.6 53.7 27.3 10 2.9 0.77 
Noise 40 dBrnC 206 0.0 7.8 35.0 54.4 2.9 2.5 0.68 
Q—10 dB 208 0.0 2.4 11.5 69.7 16.3 2.0 0.61 
Q—20 dB 206 146 37.4 40.3 7.8 0.0 3.6 0.83 

Notation: 


PCM = 48-kb/s 15-segment PCM (mid-tread) 
NIC = 51-kb/s NIC PCM 

ADPCM = 48-kb/s ADPCM (with step-size leak) 
SLC = 48-kb/s SLC ADM 


Table Vill—Detailed tabulation of subjective scoring 
for number of codecs in tandem 


Tandem % % % % % 
Codec Encodings Votes Exc. Good Fair Poor Uns. MOS 
PCM 1 106 «35.9 ~=55.7 6.6 1.9 0.0 4.3 0.66 
PCM 2 105 41.9 45.7 11.4 1.0 0.0 4.3 0.70 
PCM 4 106-226 ©=6©60.4 8614.2 2.8 0.0 4.0 0.69 
PCM 6 106 «21.7 51.9) 23.6 2.8 0.0 3.9 0.75 
PCM 8 106 16.0 41.5 36.8 5.7 0.0 3.7 0.81 


(continued) 
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Table VIII (cont) 


PCM 1 106 14.2 52.8 31.1 1.9 0.0 3.8 0.70 
PCM, 2 106 «12.30 84.9 46.2 6.6 0.0 3.5 0.79 
PCMi 4. 106 0.0 15.1 585 26.4 0.0 2.9 0.63 
PCM 8 106 0.0 28 340 53.8 94 2.3 0.68 
NIC 1 106 7.5 51.9 33.0 7.5 0.0 3.6 0.74 
NIC ve, 106 28 349 425 19.8 0.0 3.2 0.79 
NIC 4 106 0.0 7.5 340 46.2 12.3 2.4 0.79 
NIC 8 106 0.0 0.9 13.2 48.1 37.7 1.8 0.70 
ADPCM 1 106 «226 8 8=641.5 9 31.1 4.7 0.0 3.8 0.83 
ADPCM 2 105 19 343 49.5 143 0.0 3.2 0.71 
ADPCM 4 106 0.9 85 49.1 39.6 1.9 2.7 0.70 
ADPCM 8 106 0.0 0.9 17.9 53.8 27.4 1.9 0.70 
SLC™ 1 104 279 40.4 26.0 4.8 10 39 0.90 
SLC 2 105 «138.38 = 45.7 33.8 6.7 1.0 36 0.83 
SLC 4 106 0.9 4104 44.3 42.5 1.9 2.7 0.78 
SLC 8 106 0.0 19 26.4 47.2 24.5 2.1 0.76 
ADPCM* 1 (at16kb/s) 106 0.0 1.0 18.2 41.5 44.3 1.7 0.78 
ADPCM* 1(at24kb/s) 106 0.0 11.3 528 32.1 3.8 2.7 0.71 
ADPCM* 1(at32kb/s) 105 29.5 41.0 25.7 3.8 0.0 4.0 0.84 
ADPCM* 1(at48kb/s) 106 26.4 50.9 21.7 1.0 0.0 4.0 0.72 
* ADPCM conditions (without step-size leak) which tie into those of Table V. 


Notation: 
PCM = 64-kb/s 15-segment PCM (mid-tread) 
PCM = 48-kb/s 15-segment PCM (mid-tread) 
NIC = 35-kb/s NIC PCM 
ADPCM = 32-kb/s ADPCM (without step-size leak) 
SLC = 87.7-kb/s SLC ADM 


Table IX—Detailed tabulation of subjective scoring 
for local reference connection tests 


% % % % % 

Connection Votes Exc. Good Fair Poor Uns. MOS a 
AL-ACO-AL 106 25.5 55.7 16.0 2.8 0.0 4.0 0.73 
AL-DCO-AL 106 22.6 55.7 18.9 2.8 0.0 4.0 0.73 
AL-ACO-P 106 9.4 57.5 29.2 3.8 0.0 3.7 0.68 
AL-ACO-P} 105 6.7 40.0 39.0 13.3 1.0 3.4 0.83 
AL-ACO-N 106 3.8 34.0 50.0 12.3 0.0 3.3 0.73 
AL-ACO-A 106. =15.1 37.7 41.5 5.7 0.0 3.6 0.81 
AL-ACO-S 106 4.7 51.9 38.7 4.7 0.0 3.6 0.66 
P-ACO-AL 105 17.1 51.4 28.6 2.9 0.0 3.8 0.74 
P,-ACO-AL 106 6.6 43.4 41.5 7.5 0.9 3.5 0.77 
N-ACO-AL 106 1.9 37.7 51.9 7.5 0.9 3.3 0.68 
A-ACO-AL 106 3.8 47.2 40.6 8.5 0.0 3.5 0.70 
S-ACO-AL 106 6.6 53.8 34.0 5.7 0.0 3.6 0.69 
P-ACO-P 106 10.4 55.7 33.0 0.9 0.0 3.8 0.64 
P1-ACO-P1 104 5.8 33.7 50.0 10.6 0.0 3.4 0.74 
N-ACO-N 106 1.9 29.2 36.8 27.4 4.7 3.0 0.91 
A-ACO-A 106 3.8 28.3 53.8 13.2 0.9 3.2 0.75 
S-ACO-S 106 0.9 26.4 59.4 13.2 0.0 3.2 0.64 
P-ACO-P 105 2.9 39.0 51.4 5.7 0.9 3.4 0.68 
P-ACO-N 106 0.9 27.4 54.7 17.0 0.0 3.1 0.68 
P-ACO-A 106 4.7 30.2 54.7 10.4 0.0 3.3 0.71 
P-ACO-S 105 2.9 55.2 38.1 3.8 0.0 3.6 0.62 
Py -ACO-P 106 5.7 45.3 38.7 10.4 0.0 3.5 0.75 
P,-ACO-N 106 0.9 17.9 61.3 19.8 0.0 3.0 0.64 
P -ACO-A 106 0.9 15.1 63.2 19.8 0.9 3.0 0.65 
P -ACO-S 106 1.9 33.0 50.9 14.2 0.0 3.2 0.70 
N-ACO-P 106 0.9 33.0 51.9 12.3 1.9 3.2 0.78 
N-ACO-P 1 106 2.8 17.0 48.1 29.2 2.8 2.9 0.82 
N-ACO-A 105 1.9 26.7 55.2 15.2 1.0 3.1 0.72 
N-ACO-S 106 0.9 32.1 50.9 16.0 0.0 3.2 0.70 
A-ACO-P 105 7.6 34.3 52.4 5.7 0.0 3.4 0.72 
A-ACO-Pj 105 3.8 28.6 48.6 18.1 1.0 3.2 0.79 
A-ACO-N 104 0.0 14.4 58.7 25.0 1.9 2.9 0.67 

(continued) 
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Table IX (cont) 


A-ACO-S 106 3.8 31.1 52.8 12.3 0.0 3.3 0.72 
S-ACO-P 106 4.7 44.3 47.2 3.8 0.0 3.5 0.65 
S-ACO-P1 106 0.0 14.2 55.7 29.2 0.9 2.8 0.67 
S-ACO-N 106 1.9 24.5 52.8 19.8 0.9 3.1 0.74 
S-ACO-A 106 0.0 18.9 67.0 14.2 0.0 3.1 0.57 
Notation: 


P = 64-kb/s 15-segment PCM (mid-tread) 
P, = 48-kb/s 15-segment PCM (mid-tread) 
N = 35-kb/s NIC PCM 

A = 32-kb/s ADPCM (without step-size leak) 
S = 87.7-kb/s SLC ADM 

AL = analog loop 

ACO = analog central office 

DCO = digital central office 


Table X—Detailed tabulation of subjective scoring for excha 
reference connection tests 


% % % % % 
Connection Votes Exc. Good Fair Poor Uns. MOS 
AL-ACO-AEX-ACO-AL 106 1.5 49.1 39.6 3.8 0.0 3.6 
P-ACO-P-ACO-P 106 5.7 37.7 54.7 1.9 0.0 3.5 
P1-ACO-P;-ACO-P, 106 3.8 23.6 40.6 27.4 4.7 2.9 
N-ACO-N-ACO-N 106 0.0 7.5 44.3 42.5 5.7 2.5 
A-ACO-A-ACO-A 105 0.0 12.4 61.0 21.0 5.7 2.8 
S-ACO-S-ACO-S 106 0.0 7.5 61.3 30.2 0.9 2.8 
P-DCO-P-DCO-P 106 36.8 50.0 13.2 0.0 0.0 4.2 
P1-DCO-P1-DCO-P, 106 14.2 60.4 19.8 5.7 0.0 3.8 
N-DCO-N-DCO-N 106 10.4 50.9 36.8 1.9 0.0 3.7 
A-DCO-A-DCO-A 106 15.1 47.2 32.1 5.7 0.0 3.7 
S-DCO-S-DCO-S 105 1.9 11.4 60.0 25.7 1.0 2.9 
Noise 10 dBrnC 212 20.3 56.6 23.1 0.0 0.0 4.0 
Noise 20 dBrnC 211 2.8 29.9 47.9 16.6 2.8 3.1 
Noise 30 dBrnC 211 0.0 8.5 30.8 46.9 13.7 2.3 
Noise 40 dBrnC 212 0.0 1.9 118, 44.8 41.5 1.7 
Q—5 dB 212 0.0 0.0 0.0 10.8 89.2 1.1 
Q—10 dB 211 0.0 0.9 9.0 39.8 50.2 1.6 
Q—15 dB 211 0.9 5.7 37.4 46.0 10.0 2.4 
Q—20 dB 211 5.7 30.3 47.4 16.1 0.5 3.3 
Q—25 dB 210 32.4 48.6 15.7 3.3 0.0 4.1 
Notation: 


P = 64-kb/s 15-segment PCM (mid-tread) 
P, = 48-kb/s 15-segment PCM (mid-tread) 
N = 35-kb/s NIC PCM 

A = 32-kb/s ADPCM (without step-size leak) 
S = 37.7-kb/s SLC ADM 

AL = analog loop 

ACO = analog central office 

DCO = digital central office 

AEX = analog exchange trunk 
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nge 


0.68 
0.63 
0.92 
0.72 
0.72 
0.60 
0.67 
0.73 
0.68 
0.79 
0.69 
0.66 
0.82 
0.82 
0.74 
0.31 
0.69 
0.78 
0.81 
0.78 


Table Xi—Detailed tabulation of subjective scoring 
for toll reference connection conditions—part 1 


% % % % % 

Connection Votes Exc. Good = Fair Poor Uns. MOS o 
AL-S1-AL -112 17.0 58.0 22.3 2.7 0.0 3.9 0.70 
AL-S1-P 102 19.6 50.0 30.4 0.0 0.0 3.9 0.70 
AL-S}-P} 111 12.6 42.3 36.0 9.0 0.0 3.6 0.82 
AL-S1-N 112 2.7 35.7 47.3 13.4 0.9 3.3 0.75 
AL-S1-A 112 10.7 44.6 38.4 6.3 0.0 3.6 0.76 
AL-S1-S 111 5.4 54.1 35.1 5.4 0.0 3.6 0.68 
AL-S9-AL 111 30.6 49.5 18.0 1.8 0.0 A.1 0.74 
AL-S9-P 112 18.8 56.3 23.2 1.8 0.0 3.9 0.70 
AL-So-P1 111 9.9 53.2 30.6 5.4 0.9 3.7 0.77 
AL-So-N 112 6.3 36.6 40.2 17.0 0.0 3.3 0.83 
AL-So-A 102 8.8 40.2 41.2 9.8 0.0 3.5 0.79 
AL-So-S 112 14.3 43.8 36.6 5.4 0.0 3.7 0.78 
AL-S3-AL 112 25.9 57.1 15.2 1.8 0.0 4.1 0.69 
AL-S3-P 112 30.4 50.0 19.6 0.0 0.0 4.1 0.70 
AL-S3-P1 102 11.8 43.1 41.2 3.9 0.0 3.6 0.74 
AL-S3-N 102 7.8 43.1 41.2 7.8 0.0 3.5 0.75 
AL-S3-A 102 4.9 43.1 41.2 10.8 0.0 3.4 0.75 
AL-S3-S 112 15.2 49.1 30.4 5.4 0.0 3.7 0.78 
AL-L}-AL 111 21 15.3 51.4 27.0 3.6 2.9 0.81 
AL-L1-P 112 1.8 19.6 49.1 28.6 0.9 2.9 - 0.76 
AL-Lj-P1 112 0.9 16.1 55.4 27.7 0.0 2.9 0.68 
AL-L1-N 102 1.0 10.8 50.0 37.3 1.0 2.7 0.70 
AL-L1-A 112 0.9 10.7 50.0 35.7 2.7 AT | 0.72 
AL-L1-S 101 1.0 20.8 51.5 26.7 0.0 3.0 0.72 
AL-Lo-AL 112 1.8 14.3 52.7 31.3 0.0 2.9 0.71 
AL-Lg-P 111 2.7 28.8 46.8 21.6 0.0 3.1 0.77 
AL-Lo-P1 111 0.9 13.5 55.0 30.6 0.0 2.9 0.67 
AL-Lo-N 112 0.0 13.4 51.8 33.9 0.9 2.8 0.68 
AL-Lo-A 111 0.9 9.9 57.7 29.7 1.8 2.8 0.68 
AL-Lo-S 111 1.8 21.6 54.1 22.5 0.0 3.0 0.72 
AL-L3-AL 112 17.9 62.5 18.8 0.9 0.0 4.0 0.63 
AL-L3-P 112 13.4 50.0 32.1 4.5 0.0 3.7 0.75 
AL-L3-P1 112 10.7 45.5 40.2 3.6 0.0 3.6 0.72 
AL-L3-N 112 8.0 43.8 41.1 7.1 0.0 3.5 0.74 
AL-L3-A 112 8.0 42.0 42.0 8.0 0.0 3.5 0.76 
AL-L3-S 112 5.4 39.3 47.3 8.0 0.0 3.4 0.72 
AL-W-AL 112 1.8 20.5 58.9 18.8 0.0 3.1 0.68 
AL-W-P 112 1.8 10.7 53.6 33.9 0.0 2.8 0.69 
AL-W-P 102 2.0 6.9 04.9 35.3 1.0 2.7 0.68 
AL-W-N 101 0.0 9.9 47.5 41.6 1.0 2.7 0.66 
AL-W-A 112 1.8 TA 51.8 39.3 0.0 2.7 0.67 
AL-W-S 112 0.0 10.7 55.4 33.0 0.9 2.8 0.64 
AL-D-AL 111 39.6 48.6 10.8 0.9 0.0 4.3 0.68 
AL-D-P 102 29.4 53.9 14.7 2.0 0.0 4.1 0.71 
AL-D-Pj 112 17.9 53.6 26.8 1.8 0.0 3.9 0.71 
AL-D-N 112 8.0 33.9 45.5 12.5 0.0 3.4 0.80 
AL-D-A 112 18.8 57.1 20.5 3.6 0.0 3.9 0.73 
AL-D-S 111 22.5 38.7 34.2 4.5 0.0 3.8 0.84 
Noise 10 dBrnC 224 44,2 46.0 9.8 0.0 0.0 4.3 | 0.65 
Noise 20 dBrnC 214 3.7 43.9 47.7 4.7 0.0 3.0 0.65 
Noise 30 dBrnC 214 0.9 15.0 48.6 34.6 0.9 2.8 0.73 
Noise 40 dBrnC 224 0.0 2.7 29.0 60.7 7.6 2.3 0.63 

Q—5 dB 223 0.0 0.0 0.0 21.1 78.9 1.2 0.41 

Q—10 dB 226 0.0 0.0 7.1 51.3 41.6 1.7 0.61 

Q—15 dB 223 0.0 5.8 37.7 43.9 12.6 2.4 0.77 

Q—20 dB 214 13.1 23.4 37.9 23.8 1.9 3.2 1.01 

Q—25 dB 224 32.1 43.8 21.0 3.1 0.0 4.1 0.81 
Notation: 


P, P,, N, A, S, AL—see Table X 
$1, Se, S3, Ly, Le, L3, W, D—see Figs. 19a, 19b, and 19c in text. 
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Table Xll—Detailed tabulation of subjective scoring 
for toll reference connection conditions—part 2 


% % % % % 

Connection Votes Exc. Good Fair Poor Uns. MoS o 
P-S1-P 108 40.7 50.0 9.3 0.0 0.0 4.3 0.63 
Py-S1-Pi 108 10.2 48.1 37.0 4.6 0.0 3.6 0.73 
N-S1-N 107 2.8 26.2 51.4 17.8 1.9 3.1 0.78 
A-S1-A 108 13.9 35.2 35.2 15.7 0.0 3.5 0.92 
§-S1-S 108 6.5 45.4 41.7 6.5 0.0 3.5 0.71 
P-So-P 108 23.1 64.8 12.0 0.0 0.0 4.1 0.58 
P1-So-P1 107 16.8 50.5 29.9 2.8 0.0 3.8 0.74 
N-So-N 108 6.5 23.1 55.6 14.8 0.0 3.2 0.77 
A-So-A 108 5.6 33.3 52.8 8.3 0.0 3.4 0.71 
S-S9-S 108 13.9 51.9 28.7 5.6 0.0 3.7 0.76 
P-S3-P 108 31.5 56.5 10.2 1.9 0.0 4,2 0.68 
P -S3-Pi 107 11.2 38.3 44.9 4.7 0.9 3.5 0.79 
N-S3-N 108 2.8 34.3 43.5 17.6 1.9 3.2 0.82 
A-S3-A 106 3.8 34.0 53.8 8.5 0.0 3.3 0.68 
S-S3-S 107 14.0 50.5 29.9 5.6 0.0. 3.7 0.77 
P-L1-P 108 0.0 13.0 59.3 27.8 0.0 2.9 0.62 
Py-L1-P1 108 0.0 10.2 59.3 30.6 0.0 2.8 0.60 
N-L1-N 108 0.0 8.3 60.2 28.7 2.8 2.7 0.64 
A-Ly1-A 108 0.0 6.5 54.6 38.0 0.9 227 0.61 
S-L1-S 107 0.0 4.7 57.0 37.4 0.9 2.7 0.58 
P-Lg-P 108 0.0 17.6 58.3 24.1 0.0 2.9 0.64 
P1-Lo-P1 108 ~=—s«O« 7.4 61.1 28.7 2.8 2.7 0.63 
N-Lo-N 107 0.0 4.7 57.9 35.5 1.9 2.7 0.60 
A-Lo-A 108 0.0 6.5 58.3 34.3 0.9 2.7 0.60 
S-Lo-S 108 0.0 5.6 60.2 33.3 0.9 2.7 0.58 
P-L3-P 106 17.9 60.4 21.7 0.0 0.0 4.0 0.63 
P1-L3-P1 108 12.0 49.1 33.3 5.6 0.0 3.7 0.76 
N-L3-N 107 1.9 27.1 56.1 15.0 0.0 3.2 0.69 
A-L3-A 108 6.5 35.2 50.0 8.3 0.0 3.4 0.73 
S-L3-S 107 2.8 47.7 42.1 7.5 0.0 3.5 0.67 
P-W-P 108 0.0 13.9 63.0 22.2 0.9 2.9 0.62 
P1-W-P1 108 0.0 8.3 59.3 31.5 0.9 2.8 0.61 
N-W-N 107 0.0 1.9 52.3 44.9 0.9 2.6 0.55 
A-W-A 106 0.0 2.8 57.5 37.7 1.9 2.6 0.58 
S-W-S 108 0.0 6.5 56.5 36.1 0.9 2.7 0.60 
P-D-P 108° = 387. 51.9 10.2 0.9 0.0 4.3 0.67 
P1-D-P1 106 18.9 43.4 33.0 4.7 0.0 3.8 0.81 
N-D-N 108 0.9 37.0 50.0 12. 0.0 3.3 0.68 
A-D-A 107 13.1 40.2 38.3 8.4 0.0 3.6 0.82 
S-D-S 108 17.6 50.0 31.5 0.9 0.9 3.8 0.71 
Noise 10 dBrnC 216 45.8 44.4 9.7 0.0 0.0 4.4 0.65 
Noise 20 dBrnC 215 8.4 52.1 36.7 2.8 0.0 3.7 0.67 
Noise 30 dBrnC 214 1.4 13.6 52.8 30.8 1.4 2.8 0.73 
Noise 40 dBrnC 216 0.0 0.0 24.5 58.8 16.7 2.1 0.64 

Q—5 dB 215 0.0 0.0 0.0 16.7 83.3 1.2 0.37 

Q—10 dB 216° 0.0 0.0 9.7 54.2 36.1 1.7 0.62 

Q—15 dB 215 0.9 9.8 38.6 46.0 4.7 2.6 0.77 

Q—20 dB 213 12.7 36.6 ‘41.8 8.9 0.0 3.5 0.83 

Q—25 dB 216 50.0 38.4 10.6 0.9 0.0 4.4 0.71 
Notation: 
See Table XI. 
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A Loss Model for Parabolic-Profile Fiber Splices 


By C. M. MILLER and S. C. METTLER 
(Manuscript received April 26, 1978) 


In the past, measurement results of splice loss of optical fibers have 
corresponded poorly to existing theory, which assumes a uniform power 
distribution across the cone of radiation defined by the local numerical 
aperture. In this paper, a model is developed in which a Gaussian power 
distribution across the local numerical aperture is assumed. Trans- 
mission through a splice at each point on the transmitting core is found 
to depend on the ratio of receiving to transmitting numerical aperture 
at that point. Numerical integration of these “point” transmission 
functions over core areas of interest yields both splice loss and the 
additional loss that occurs in a long fiber following the splice. This 
model cannot be theoretically rigorous, since it is inconsistent with 
boundary conditions required by the laws of light propagation. How- 
ever, it has been found to predict splice loss under varying conditions 
with much greater accuracy than existing theory. The model has the 
further virtue of being able to calculate how variations in many intrinsic 
and extrinsic splice parameters combine to produce an overall splice 
loss. 


l. INTRODUCTION 


Calculations for the loss in an optical fiber splice, as a function of 
offset, tilt, diameter, or numerical aperture mismatch, have been re- 
ported by several authors.!-° These calculations all assumed a uniform 
power distribution across the cone of radiation defined by the local nu- 
merical aperture (NA). This is consistent with assuming equal mode 
excitation, equal mode attenuation, and no mode coupling.* An as- 
sumption concerning the power distribution is necessary to characterize 
all combinations of both intrinsic and extrinsic splice imperfections. 
While these assumptions allow easy calculations, correspondence with 
measurement data has been unacceptable.* Gloge® reported results based 
on a diffusion process from the uniform power distribution to the 
steady-state distribution calculated by Marcuse.® Correspondence with 
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measurement data is much improved for the case of small offsets; how- 
ever, approximations used in the theory make it difficult to characterize 
other splice imperfections or splice loss for large offsets. 

This paper presents a phenomenological model with an assumed power 
distribution selected solely due to resulting correspondence of calculated 
effects of splice imperfections with measurement data. The model is not 
intended for uses other than splice loss characterization, and since the 
model does not obey the laws of ray optics exactly, caution should be 
exercised in any other uses of it. 


ll. DETAILS OF THE MODEL 


The power distribution across the cone of radiation defined by the 
numerical aperture at any point on a fiber core is assumed to be Gaussian 
in form. Figure 1 is a sketch of the assumed radially symmetric distri- 
bution across the cone of radiation from a given point on the fiber core. 
The model consists of solving for the transmission ratio in terms of 
transmitting and receiving numerical apertures, then integrating this 
ratio over the core areas of interest. 

Consider the steady-state power distribution across the cone defined 
by the local numerical aperture to be Gaussian in form, normalized to 
a value of l atr = 0. 


GP a p(r) = e777 120", (1) 
where o is proportional to the width of the Gaussian. 









r2 
p(r) = exp |—>- Inp 
genre 


Fig. 1—Gaussian power distribution. 
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Numerical aperture has been defined by various investigators to be 
the 1/e2, 0.1 or 0.01 power level. This power level, po, determines the 
width of the Gaussian and will be left as a parameter. Therefore, at 
Td; 


Do e—r07/202 (2) 
or p(r) = e(?/ro*)In po, (3) 


lil. NA MISMATCH 


The application of this simple model can best be illustrated by con- 
sidering the change in the power distribution as it propagates through 
a splice. The usual class of circularly symmetric index of refraction 
profiles* is used.throughout this paper. In this example, transmitting 
and receiving fiber profiles are identical except for the value of no. 
Therefore, the NA as a function of transmitting fiber core radius (p) is 


ays 
NA,(p) © No V 2A1 E = Gy lf * (4) 


where A, ~ (no; — n-)/no1 is small 
Noi = refractive index at center of core 
nN. = refractive index of cladding 
a = 2 for nearly parabolic profile fibers 
R = fiber core radius. 


The angle ¢1(p) (Fig. 1) is determined by NA;(p) to be 


o1() = sin“! [NA1(p)/n1(p)], (5) 
where 
p a 1/2 
n1(p) = N01 E — 2A; © ‘ (5a) 
The power distribution at a given value of p is assumed to be 
Pir) = e (r?/r1?)In Po, (6) 


where rj is proportional to tan ¢ (Fig. 1). 
For the receiving fiber, NAo(p) will first be assumed to be less than the 
transmitting fiber NA;(p), so that 


por) = e (r2/r2?)In PO (7) 


is the relative power distribution that corresponds to the same point on 
the receiving fiber. Figure 2 shows these two functions for rg < r; and 
for Gaussian distributions which are truncated at ro and rj, respectively. 
Using these truncated distributions, we assume the power in region [is 
lost immediately at the splice, since power in this region lies totally 
outside the receiving NA. Splice transmission through this point is 
then 
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\Y 
\ 


ly 







Lhe) 
"2 rm 
Fig. 2—Point transmission distributions. 


f, pa) rdr 
ees 


f. Pur) rdr 
0 






t(p) = (8) 





t(p) = [e(re/ri)7In Po — 1] (9) 


Po-l 
for ro <r. 
Since r; = k tan ¢; and re =k tan ¢2 where k is a constant of propor- 
tionality, then using (4) and (5), 
rg tan(sin7'V2A,9) | NAg (10) 
r, tan(sin-!V2A;) NA, 


for some given p. 
For this particular example, NAo/NA, is constant for every point (p) 
on the fiber core; therefore, eq. (9) gives the total transmission just after 
the splice for a truncated Gaussian distribution. 
For the case of nontruncated Gaussian distributions, (8) and (9) be- 


come 
{Pid rar + pole) rar 
r2 


J, ° Px(r) rdr 
0 


re\? 2 
t(p) =1+ () Do — e(2/rV"In po, (12) 
Py 


t(p) = (11) 
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for ro <r ;. For small values of po, (12) and (9) yield similar results. The 
full Gaussian is used to simplify later computations. 

Since this example contains radially symmetric distributions, there 
is no dependence on @. Equation (12) gives the transmission through the 
splice at a point for the steady-state power distribution assumed in (3). 
Again, for this example, (r2/r;)? is constant across the fiber core; there- 
fore, (12) gives the total transmission just after the splice for a numerical 
aperture mismatch. 


IV. COMPARISON WITH NA MISMATCH DATA 


Equation (12) can be compared to measurement data for a splice with 
an NA mismatch (re < ry, receiving fiber NA < transmitting fiber NA). 
This transmission coefficient represents an immediate loss at the splice. 
Figure 3 contains measurement data along with calculations for both 
a uniform power distribution and a Gaussian distribution. The param- 
eter, Do, is set to 1/e? and 0.1. As shown in Fig. 3, sensitivity to the value 
selected for po is significant only for large NA mismatches. 

Figure 3 contains data obtained using a HeNe and a GaAlAs laser 
source with a long input fiber (>500 m). Fibers selected for these mea- 
surements were well matched in O.D., core diameter, and a, but con- 
tained mismatches in NA. There were, however, slight differences in O.D., 
core diameter, and a on the order of a few percent, so that measured loss 
would be expected to differ somewhat from calculated values. 


O HeNe MEASUREMENTS 


GaA|As MEASUREMENTS (AVERAGE, 
RANGE AND NUMBER OF MEASUREMENTS) 
# 


UNIFORM POWER 


GAUSSIAN 


SPLICE LOSS IN DECIBELS 





Fig. 3—Loss due to numerical aperture mismatch. 
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Correspondence with measured data using either source is good 
compared to the uniform power distribution model which predicts too 
much loss. 

Before this model can be applied in general, the case of transmission 
from a smaller NA transmitting fiber to a larger NA receiving fiber must 
be considered. If, in Fig. 2, ro represents the transmitting fiber and r; 
the receiving fiber, then all the power contained in the cone of radiation 
defined by the transmitting NA is within the receiving NA. A unity 
transmission coefficient is assumed for this case. 


V. NEAR-FIELD POWER DISTRIBUTION 


The near-field power distribution as a function of core radius, p, can 
be calculated for the Gaussian model after a suitable weighting function 
is applied. Since a Gaussian distribution implies higher loss for higher 
order modes of propagation (the tails of the distribution), a weighting 
function is needed to reduce the amplitude of the Gaussian as a function 
of radius. This is required since only higher order modes of propagation 
are significant near the core-cladding interface. The weighting function 
assumed is the power distribution for the uniform power model. From 


(4), 


R 


for the case of uniform power across the cone of radiation defined by the 
numerical aperture where Po is proportional to A, no and input power.® 
With this function used as the amplitude at r = 0, then 


2 on (oe) 
P(p) © Po (1 = = f, f, e(r/ro2In po r dr dO. (14) 
R2 0 0 


Using (4) to obtain ro, 


P(p) © Po E 7 (2)'] (13) 


P(p) © Po(1 — p?)? (15) 


where all constant terms have been combined in Pp and R = 1. 


VI. COMPARISON WITH NEAR-FIELD POWER MEASUREMENTS 


Measurements of near-field power were made with a GaAlAs laser 
source after propagation through a long (>1 km) fiber wrapped under 
tension to simulate the effects of some microbending loss (~1 dB/km). 
These measurements were made using a 100X objective and a TV vidicon. 
camera. Figure 4 is a photograph of the camera output for a typical 
Western Electric fiber. An approximation to the curve was obtained by 
smoothing over the index dip and averaging power measurements for 
each side of the distribution. Calculations using (13) for the uniform 
distribution and (15) are plotted for comparison. The calculated 
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O MEASURED NEAR_ FIELD 


POWER DISTRIBUTION 
0.82um GaAlAs 


P (p) 





0 0.2 0.4 0.6 0.8 1.0 
p 


Fig. 4—Near-field power distribution. 


steady-state power distribution using the Gaussian model with the 
(1 — p?) weighting function is in excellent agreement with the measured 
distribution. 

VI. ADDITIONAL LOSS IN THE FIBER AFTER A SPLICE 


A parabolic index fiber splice is known to cause additional loss in the 
fiber after the splice.’ This additional loss depends on the differential 
mode attenuation and mode coupling characteristics of the receiving 
fiber and, for the fibers used in these experiments, is approximately equal 
to the loss at the splice for small offsets.’ Therefore, any realistic de- 
scription of an optical fiber splice must include this effect. 
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Referring to Fig. 2, the power in region II is seen to be within the nu- 
merical aperture of the receiving fiber; however, this power is improperly 
distributed. The sharp decrease in power at the edge of the receiving 
numerical aperture, re, is physically impossible; however, this effect is 
significantly reduced compared to the uniform power model. As this 
distribution propagates down the fiber length (1), we assume that a new 
Gaussian steady-state distribution will be generated, as shown in 
Fig. 5. Some portion, c, of the excess energy contained in region II of 
Fig. 2 will be lost in the process of reestablishing steady-state conditions. 
If energy couples to adjacent modes with equal probability and if all 
modes are equally excited and attenuated, then 1/2 of the power in region 
II would be lost during redistribution. Since the excess energy in region 
II is skewed toward higher order modes which may be lossier, c would 
be expected to be greater than 1/2. The loss mechanism is probably 
transfer of some of this excess energy to higher order propagating or leaky 
modes. 

Referring to Fig. 2, the equivalent transmission loss due to power lost 


from region II is . 
c | f° pe rdr — fpr) rar | 
0 0 
Ap) = SSS SS SS (16) 


f, _ P1(r) rdr 
0 


‘s (© )?Inpg 
Pgg(r) = Ae v2 





P (r) 


Fig. 5—Steady-state power distribution. 
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Evaluating this integral yields 


2 
At(p) = —ce(r2/r1)"n po + ¢ + ac) (pet): (17) 
ry 
To approximate c, we use the centroid for the solid of revolution for 
region II in Fig. 2. This places one-half the excess energy in the region 
0<r<c and the remaining energy between c <r < rg. This radius is 
taken as the value for c. From Fig. 2, 


f, * [pi(r) rdr — por) rdr] = ff ” Iplr) rdr — po(r) rdr]. (18) 


Solving the integrals yields the following nonlinear equation: 


2 
e (c/ri)?In po — G) e (c/r2)*In po 


ry 


2 
=! E + e(r2/r1)?In po — a (po + |. (19) 
2 ry 


This equation was solved for c/rg with 0 < re/r; < 1 for po = 0.1 and 
Po = 1/e?, and found to be accurately approximated by a quadratic 
equation. For po = 0.1, 


r9 2 r9 4 
c © 0.7994 — 0.08796 (2) + 0.00846 (20) 
ry 1 
and for po = 1/e?, 
2 4 
c = 0.8041 — 0.0724 () + 0.00636 ©) (21) 
ry ry 
The total effect of the splice is then 
ttot (0) = t(p) — At(p). (22) 
2 2 
tro(p) = (1-0) | 1+ po (2) etretvn ro] + e("2)', (23) 
ri ry 


where t(p) is the immediate point transmission coefficient at the splice 
given by (12) and At(p) is the reduction in transmission due to power lost 
in reestablishing a steady-state Gaussian distribution in a long fiber after 
the splice given by (17). Again, we call attention to the fact that the value 
of c probably depends on the differential mode attenuation and mode 
coupling characteristics of the receiving fiber. 


Vill. DIAMETER MISMATCH 


The necessary parts of the model have been developed so that the 
effect of diameter mismatch (Fig. 6) can now be considered. Let the 
transmitting NA function equal 


nike) Saia/on E _ Gu : (24) 
1 
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p 


Fig. 6—Core index of refraction profiles for the case of diameter mismatch. 


and the receiving NA function equal 


p \271/2 
NAo(p) = ng V2A E _ ) | ; (25) 
2 
where R; and Rg are the transmitting and receiving core radii. For K = 
R»/R, and Re normalized to unity, 


ro\2 1— p? 
i=) wy (26) 
rl | K2p2 

This ratio is not constant with p (except for K = 1), so that an integration 
over the receiving core is necessary. Distributions remain radially sym- 
metric; therefore, no angle dependence is present. 


f t(p) [1 — Kp?]? pdp 


Tot = , (27) 


1/K 
J, [1 — K2p?]? pdp 


where ¢(p) equals eq. (12) for the effect at the splice, or by ttot(o) [eq. 
(23)], to include the additional loss in the fiber after the splice. This in- 
tegral must be solved by numerical techniques. 


IX. COMPARISON WITH DIAMETER MISMATCH DATA 


Figure 7 compares the calculations of short-length-diameter mismatch 
effects at the splice with measured results for various values of K. Fibers 
were drawn from the same preform to insure that a and NA mismatch 
were minimized. A HeNe laser source was used with approximately 1 
m of fiber after the splice. 

The Gaussian power distribution model shows good agreement with 
measurement data. Long-length-diameter mismatch data are not 
presently available. 
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UNIFORM POWER 


# HeNe MEASUREMENTS 


(AVERAGE, RANGE AND NUMBER 
OF MEASUREMENTS) 


SPLICE LOSS IN DECIBELS 


GAUSSIAN 





0 0.05 0.1 0.15 0.2 0.25 0.3 0.35 
(Ry = Rr) / Ry =1-K 


Fig. 7—Loss due to core diameter mismatch. 


X. TRANSVERSE OFFSET 


Perhaps the most important extrinsic splice parameter is transverse 
offset (axial displacement). The Gaussian model can be applied to the 
case of transverse offset; however, some computational difficulties are 
encountered. Referring to Fig. 8, the area of overlap is divided into re- 
gion I, where the receiving NA is greater than the transmitting NA, and 
region II, where the receiving NA is less than the transmitting NA. In 
region I, t(p) is unity, therefore with R = 1, 


cos~!d/2 1 
So Soyo ece.g (E77? ead 
0 d/2 cos 
ja ET 


ft" ig (1 — p”)? pdpdo 


cos~!d/2 1 
f, ff vocon gf Dt — GF pdpde 
cos 
ee eee (29) 


f° f= 42 pdoas 


where q = p” — 2pd cos ¢ + a?. 


T (28) 


In region I], 
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Referring to Fig. 8, we note that the line separating regions I and II 
is a locus of equal NA; therefore, t(p) is unity on this line. On the curved 
boundary of region II, t(p) is zero; therefore, the derivative of ¢(p) is in- 
finite at the intersection of these lines.’ This point must be omitted and, 
since t(p) is steep in the vicinity of this point, a large number of incre- 
ments are required to evaluate eqs. (28) and (29) numerically. 


Ttot = Ty + Th. (30) 


If the loss at the splice is desired, (12) is used for t(p), and if the total 
loss including losses required to reestablish steady state is being calcu- 
lated, then (23) is used. 


XI. COMPARISON WITH TRANSVERSE OFFSET DATA 


Figure 9 is a comparison of transverse offset data and calculations 
(i) reported by Gloge,° (ii) using the Gaussian distribution assumption, 
and (111) using the uniform power assumption. Measurements were made 
with a GaAlAs laser source and an unbroken long fiber (2 km). After a 
reference level was established, the fiber was broken approximately in 
the center and the ends spliced to obtain the reference level again. Offset 
was introduced and the loss measured 1 km and 2 m after the splice as 
a function of transverse offset. Figure 9 shows the results for small offsets. 
Agreement among the Gloge model, the Gaussian model, and measure- 
ment data is good for loss at the splice. The Gaussian model also agrees 
well for the loss occurring in the fiber after the splice. 

Figure 10 shows results for large offsets. As compared with actual 
measurements, the Gaussian model understates the loss through a long 


LOCUS OF 
EQUAL NA 





t 1 
TRANSMITTING FIBER CORE RECEIVING FIBER CORE 


Fig. 8—Regions of overlap for offset fiber cores. 
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Fig. 9—Loss due to transverse offset. 


fiber after the splice for large offsets. The Gaussian model cannot, at 
present, account for this effect for large offsets. 


XII. CONCLUSIONS 


A point transmission model has been used to calculate splice loss 
due to NA mismatch, diameter mismatch, and transverse offset in 
parabolic-profile fiber splices. Near-field power distributions have also 
been calculated. Correspondence with measurement data is much im- 
proved as compared to calculations using the uniform power distribution 
model. 

The authors again emphasize that the Gaussian power assumption 
does not obey the laws of ray optics exactly and that this assumption was 
made primarily due to good correspondence of resulting calculations with 
measurement data and due to computational considerations. Future 
work will include the effects of a mismatch and the effects of long fibers 
after splices with combinations of intrinsic and extrinsic parameter 
mismatches. 
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O MEASURED SPLICE LOSS AT THE SPLICE 
O MEASURED SPLICE LOSS AFTER A LONG FIBER 


-—s=— GAUSSIAN DISTRIBUTION 
(po = 0.1) 


UNIFORM Y 
DISTRIBUTION 


TRANSMISSION IN PERCENT 





0 0.2 0.4 0.6 0.8 1.0 1.2 1.4 1.6 
OFFSET IN CORE RADIUS 


Fig. 10—Transmission vs offset for large offsets. 
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Optimum Reception of Digital Data Signals 
in the Presence of Timing-Phase Hits 


By D. D. FALCONER and R. D. GITLIN 
(Manuscript received March 29, 1978) 


Protection switching of digital radio channels results in a timing- 
phase discontinuity and occasional long error bursts in the demodu- 
lated data signal. Using an idealized mathematical model, we derive 
maximum likelihood receivers which rapidly track such delay hits, 
whether or not a timing-pilot tone is used. When the receiver is at a 
different physical location from the switch, the tracking algorithm must 
also sense the occurrence of a switch. A dual-mode, data-directed 
structure is revealed as being optimum; a narrowband tracking loop 
is used for steady-state operation, while a wideband tracking loop 
provides rapid recovery from the timing transient. An error-sensing 
nonlinearity, which incorporates hysteresis, inhibits erroneous noise- 
induced mode transitions. Oversampling of the demodulated data 
signal rapidly establishes a coarsely quantized, optimum sampling 
phase and permits the dual-mode tracking loop to operate in a data- 
directed manner. Data-directed operation permits greater loop 
bandwidths, since the data energy is not perceived as noise. Simulation 
of a digital data transmission system employing a dual-mode, data- 
directed, and coarse-quantized timing loop has demonstrated dramatic 
reduction in the length of error bursts following a protection switch. 
For example, at the data rate of 1.544 Mb/s, a conventional phase- 
locked loop with a 100-Hz bandwidth, when displaced a half-symbol 
interval by a delay hit, would typically sustain an error burst 15,000 
bits in duration. When such a delay hit stresses the dual-mode timing 
loop, simulation has indicated error bursts on the order of 15 bits in 
duratton. 
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I. INTRODUCTION 


Some channels used for data transmission exhibit occasional abrupt 
changes in their absolute delay. For example, during severe fading, 
line-of-sight microwave facilities commonly switch signals from their 
regularly assigned channel to a protection channel which, owing to dif- 
ferent filtering, cable lengths, etc., may impart a different delay. Such 
a switch, unknown to the receiver, changes the best phase with which 
a synchronous receiver should sample the incoming signal during each 
symbol interval. Until this new optimal timing phase is acquired by the 
receiver, data errors may proliferate if the delay change is a significant 
fraction of the symbol interval. The length of the succession of errors 
will be essentially inversely proportional to the bandwidth of the 
timing-recovery loop or filter. 

The object of this investigation is to determine and analyze signal 
processing structures which rapidly respond to a sudden change in timing 
phase (a delay hit) while also providing accurate steady-state timing 
information when the protection channel is not required. Based upon 
an idealized channel. model, we determine the maximum likelihood 
(optimum) receiver, and practically motivated approximations are made 
to provide realizable signal processors. The proposed receivers mediate 
the inherent conflict between using a narrowband timing recovery loop 
for steady-state operation, so that accurate and stable timing can be 
derived from the noisy received signal, and using a wideband loop to 
follow a timing-phase transient. As might be expected, the derived 
tracking loop is of the dual-mode variety; i.e., it automatically senses the 
state of the system (i.e., transient or steady state) and adjusts its struc- 
ture accordingly. The exact form of the loop depends on the detailed 
manner in which the disturbances are modeled; yet it is demonstrated 
that the essential features of the signal processors are quite robust and 
have significant intuitive appeal. 

In this study, our development is for an arbitrary protection-switched 
data communication system; however, specific simulation results and 
special emphasis are given to a 4-input level, Class-IV, partial response 
signaling format, such as used in the DUV system.! 

In Section I, we describe the system model—principally the statistical 
mechanism for generating a “delay hit.” The optimum receiver is de- 
scribed in Section III, and various suboptimum realizable structures are 
developed in Section IV. Digital implementation of these techniques in 
the partial response system is reported, via simulation, in Sections V 
and VI. 
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ll. SYSTEM MODEL FOR TIMING-RECOVERY PROBLEM 


In this section, we propose and develop a mathematical model for the 
data transmission system under consideration. Any attempt to exactly 
model the end-to-end data channel will be exceedingly tedious and 
probably fruitless, since the FM demodulator and phase-hit mechanism 
of most microwave facilities are highly nonlinear effects. Our approach 
is to isolate the major manifestations of a timing-phase discontinuity 
and background noise via a simple model, and then to apply maximum 
likelihood detection to obtain useful receivers. The validity of this ap- 
proach is measured by simulation of the receiver over a real channel. We 
begin by writing the transmitted baseband data signal? as 


s(t) = Na,h(t —nT) + pe sin 7, (1) 


where {a,} is a sequence of independent multilevel symbols, h(t) is a 
band-limited transmitted pulse, 1/T is the symbol rate, and p, is the 
parameter which indicates the power in.the pilot tone located at 1/2T 
Hz. The purpose of the pilot tone is to aid in providing the receiver timing 
phase and frequency. It will be assumed that the end-to-end pulse 
shaping used in the system is such that the desired sampling instants 
are t = nT. Whenever the pulse h(t) possesses more than the minimum 
Nyquist bandwidth, it is convenient to rewrite (1) as 


s(t) =X [an + o(-1)"Jalt — nT), (2) 


where it is recognized that £,(—1)"h(t — nT) is periodic with period 2T, 
and p, is the product of p and the energy in the pulse at 1/2T Hz. Since 
h(t) is customarily band-limited to less than 1/T Hz, only the funda- 
mental component of the signal 2,,(—1)"h(t — nT) will be transmitted 
through the filter h(t), thus the sinusoid may be represented by the al- 
ternating (dotting pattern) series. Indeed, in practice, a dotting pattern 
is frequently used to generate the tone. The transmitted signal may be 
rewritten as : 


s(t) = Lie,h(t — nT), (3) 


where 
Cn = An + p(—1)". (4) 


Recall that partial response signals can be generated by either digital 
filtering of the independent data symbols, {a,}, or by the use of special 
non-Nyquist pulse shapes. The receiver structures derived in the sequel 
will be discussed for both Nyquist and partial-response shaping. We now 
turn to the specific idealizations we will make to model the transmission 
path. 
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In the absence of any transmission distortion, the received baseband 
signal, r(t), can be modeled as 


r(t) = J[s(t)] + v(t), (5) 


where J[s(t)] is the time-jittered signal and where the additive noise v(t) 
will be taken as white Gaussian with spectral density No. For the purpose 
of analytical tractability, any instability in the timing phase will be 
modeled by representing the received signal as 


r(t) = >i c,h(t — nT — A,) + v(t), (6) 


where A,, is a random process whose characteristics will be described 
below. In the above model, which is shown in Fig. 1, the phase of the pilot 
tone is presumed to be jittered at the discrete instants, {nT}, in the same 
manner as the phase of the data signal. This is accurate when the pilot 
tone is generated via the dotting pattern method, and the timing in- 
stabilities arise solely in the transmitter clock. Any timing-phase jitter, 
A(t), that occurs during transmission should properly be modeled by 
r(t) =s(t — A(t)) + v(t). However, this leads to analytical difficulty in 
characterizing the statistical nature of the random process A(t), as well 
as having to contend with jitter-induced amplitude modulation of the 
received signal. With this caveat in mind, we lump all sources of 
timing-phase jitter into the model given by (6). Of course, the utility of 
the above model will be measured by the performance of the derived 
receivers in the real-world environment. 

The standard approach to the tracking of a slowly varying timing 
phase, and thus the timing frequency, is to use a narrowband filter 
centered about 1/2T Hz to extract the transmitted pilot tone. Of course, 
the bandwidth of this filter must be quite narrow to attenuate the in- 
band data-plus-noise energy. Extremely small effective bandwidths are 


{0} 






IDEALIZED CHANNEL 
WHICH PRODUCES 
TIMING HITS AT TIMES 


{29} 


r(t) 


r(t) => c, h{t—nT—A,) + p(t) 
Fig. 1—Idealized model of timing hits in a digital data transmission system. 
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achieved in practice by following a narrowband zonal filter with a 
phase-locked loop (PLL) whose voltage-controlled oscillator (VCO) is 
tuned to 1/27 Hz. However, the narrow bandwidth of the PLL will pre- 
clude rapid tracking of any sudden change in the timing phase. 

We digress momentarily to recall that the conventional envelope- 
derived timing scheme,** which does not utilize a pilot tone, will not 
provide timing information as the bandwidth of the system decreases 
to 1/2T Hz. It should be pointed out, however, that other nonquadratic 
techniques not requiring a pilot tone will provide a tone at the symbol 
rate for such minimum bandwidth systems; in particular, Saltzberg° has 
shown that the average of sgn[s(t)s(¢ — T)] provides a tone at 1/T Hz, 
and it is apparent that quartic® and similar operations will also provide 
the desired tone. 

Returning to the formulation of our system model, we let the dynamic 
evolution of the timing jitter be given by the difference equation 


An+1 = An + Wn + QnUn, (7) 


where {w,,} and {v,,} are sequences of mutually and self-independent 
Gaussian random variables with variances o? and py? respectively, and 
where p2 > a2. The variable a, is governed by 


_ (0, with probability 1 — po 


8 
1, with probability po 2) 


an 


where 0 < po « 1. Note that {A,,} is a Markov sequence where the mu- 
tually independent sequences {w,,} and {v,} model the steady-state and 
the transient (delay-hit) modes, respectively. The initial value Apo will 
be assumed to be uniformly distributed on (0,7). Clearly, most of the 
time there are no delay hits; i.e., a,, = 0, and the timing phase wanders 
about the correct value. Thus, po is the probability that a timing dis- 
continuity (which would follow a protection switch) occurs during a 
symbol interval. A typical sample path, or realization, of {A,,} is shown 
in Fig. 2, where the relative frequency of delay hits is determined by po. 
This simple two-mode model for the timing phase will be used to derive 
the optimum and various suboptimum data detectors, where an integral 
component of these detectors will be the timing-recovery loop. The 
steady-state jitter, w,, is incorporated so that the timing loop will con- 
tinually adjust the receiver’s timing phase; note that this mechanism 
allows the receiver to presume nominal knowledge of the timing fre- 
quency, and any inaccuracy or drift in this quantity will be compensated 
for by the timing-phase tracking system. 

With {A,,} specified by (7), the joint probability density function (pdf) 
of the {A,,} sequence is given by 
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TIME 


TIMING PHASE (4,) 





Mn+) =4n + War tan Van 


! 
NH 


Fig. 2—A typical timing-phase trajectory (a, = 1 signifies that a timing hit has 
occurred). ; 


p(A) = p(Ao, A1 +++, An) = p(Ao)p (Ai, Ag, +++ ,An| Ao) 
= p(Ao)p(Aj| Ao) p(Ag, +++ ,An| Ar, Ao) 
= p(Ao)p(Ai| Ao)p (Ag| Ao, 41) p (Az, * ++ ,An| Ao, Ar, Ag) 
= p(Ao)p(Ai| Ao) p (Ag| A1)p (Ag| Ag) - - p(An| An-1) 


= p(Ao) I p(Ai| A:-1), (9) 


where A = (Ao, Ay, «++, Ap). | 
Using (7) and (8), the conditional density is given by the mixture 


p(A;| Aj-1) = (1 — Po) e—(Ai— Aj-1)?/202 


Po (A: Az1)2/9( 2+ 2 
a (Aj Aj- ) /2( +o ) 
"Vim (u2 + 02172 ° ia 


and thus the joint pdf is given by (9) and (10) where Ao is distributed 


uniformly over (0,7). . 
Now that the system model has been specified, we turn to our pro- 
fessed goal of deriving optimum and suboptimum receivers. 
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iil, OPTIMUM RECEPTION OF TIME-JITTERED PAM DATA SIGNALS 


It is well known that the optimum (minimum probability of error) 
data-sequence detector maximizes the a-posteriori (MAP) probability 
density of the received signal with respect to the data sequence.* Thus 
the MAP receiver will supply the end-user with a sequence of decisions 
which maximize the probability density function p[{é,)}|r(t), 0 <t < 
T|, where the observation interval is (0,7). By virtue of (4) and the 
properties of MAP receivers, we may estimate G,, via dm = €m — p(-1)”, 
ie., the estimates of {c,,} and {a,,,} are related as above. Since all the data 
sequences {a,,} are equiprobable, the relevant probability density can 
be obtained by averaging over the jittered timing phases {A,}, i.e., the 
MAP density is proportional to 


P[r(t)|{am}, OS t ST] = Splr(t) fam}, {An}, 
0<t<T\]p[AldA, (11) 


where the conditional density in the integrand is given by the standard 
formula for the probability density functional of a known signal in white 
Gaussian noise, 


plr(t) {am}, {Am}, 0 <t < T]=kexp or ON, Jo ” ir(t) 


-~¥é,h(t —mT — A,,))dt}. (12) 
In the above equation, & is a constant independent of both {a,,} and {A,,}. 


The maximization of (11) with respect to {é,,}, or equivalently {d,,}, can 
be facilitated by writing (11) as 


7 
PiIr(t) {an}, O<t< T)]= kf ddexp |- = ae lide [r(t) 
2No 
—- > é,h(t — mT — A,,)|2dt — 2No In p(a) |}. (13) 
It can be shown that, in a high signal-to-noise-ratio environment |[i.e., 


as Ny — 0], the above integral with respect to A can be replaced by the 
maximum value of the integrand, i.e., as No > 0 


pir()llank 0<¢< T]~exp |---| (" [r(t) 


— > é,h(t — mT — A*,)]2dt — 2No In pa) ||, (14) 
m 
where {A*} is the maximizing sequence. Thus, under the asymptotic 


* The bit-optimum detector has been shown to be asymptotically approximated (at high 
signal-to-noise ratio) in performance by the optimum sequence detector (Ref. 7). 
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condition described above, the optimum receiver computes the joint 
minimum* of ; 


Alr(t)|{@m}, {Am}, 0 St < T] =—In p[r(é)|{an}, 
{Am}, 0 <t<T]—2Nolnp(A) (15a) 


=f [r@ -xenhle- mT - An) | dt 


— 2No | > Inp(Am|Am-1) + In p(Ao) | (15b) 
m=0 


= Ay[r(t)|{@m}, {Am}] + AclAmI], (15c) 


where A,[-] and A.[-] are defined in the obvious manner from (15b). For 
convenience in notation, we drop the d,, and A,, symbols in favor of a,, 
and A,, whenever there is no possibility of confusion. We also adopt the 
notational shorthand 


Lflam}, {Am}] = Alr(t) lam}, {Am}, 0 Sts T], (16a) 
= Li[lam}, {Am}] + @altAmnd], (16b) 


where the £; corresponds to the appropriate A; (i = 1, 2) in (15c). Thus, 
our task is to jointly minimize é[{a,,}, {4,,}] with respect to the 
discrete-valued variables {a,,} and the continuous-range variables {A}. 
Since f@ r2(t)dt is independent of {c,,} and {A,,}, the relevant portion 
of £1[{am}, {Am}] is given by 


Ly[fam}, {Am}] = —2 © emz(mT + Am) 
° ay » Cee (m= RYT FP Age Ag); AD) 
where the matched-filter output z(t) is given by 
2(t) = f _ h(t’ = t)r(e/)de' (18) 


and 


g(t) = f- ; h(t/)h(t + t’)dt’ (19) 


is the channel correlation function. Thus the sufficient statistics are the 
set of matched-filter output samples {z(mT + A,,)}, where the sampling 
phases {A,,} are still to be determined. The other component of the 
likelihood is given by 


.,° It should be clear from (15b) and (10) that, in the absence of a noise or timing-phase hit, 
A;, > An and Gp, — an; i.e., the estimates tend to the true parameter values. 
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Lo[{Am}] = —2No In p(Ao) 


~ INo = In (Fane Ger tn ile 


To 


4 0 6 (An Am-)2/2H2+ 0) (2) 
a (u2 os o2)1/2 © 


and the optimum receiver minimizes @ = €, + @, where @; and 5 are 
given by (17) and (20), respectively, with respect to {a,,} and {A,,}. Op- 
timization with respect to {A,,} is via differentiation and gives 


(Ai — Ao)G[Ay — Ao] + ve Clan}, {Am}) = (21) 


and 


(Arti — Ap)G[Agti — Ap] — (Ap — Ap-1)G[Ag — Ag-i] 





O 
+ — nits 1 in = 0, k=1,2,--- 99 
Say Lalla [Ami] = 0 (22) 
where the function G]-] is defined by* 
(1 — po) 5 . 
At Pol = 92 ree Se —7 2/242 
Gtx) =| ¢e2No exp {9/209 + ong) OP eH 
p= Po exp {—x?/207} + ze exp {—x2/2yu?} 
o ML 


(23) 


As we see in the next section, the nature of G[ | will impart a dual-mode 
character to the various tracking loops described in the sequel. It is 
convenient to define the weighted differential-epoch 


nh = (A, — Ap-1)G[A, — Az-il, (24) 
and (21) and (22) can thus be written as 


Of; 
= ee At pers k=0,1,2,---, 25 
The = Tk a [lam}, {Am}] (25) 


where 7 = 0. 

Several difficulties associated with the “iteration” prescribed by (25) 
preclude incorporation in a realistic detector: (¢) as already mentioned, 
Ao is unknown, (ii) as it stands, the optimization over Ay, is for a given 
set of {a,,}, (vii) from (17) it is clear that 02;/0A; depends on all the {a,,} 
and {A,,}, and (iv) optimization of (17) with respect to the {a,,} requires 
a Viterbi-related dynamic programming algorithm.® (The state size is 


* We have assumed that » > o so that uw? + o? = p?. The detailed nature of the function 
G{-] is discussed in the next section. 
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somewhat ambiguous, since the presumably finite memory of g(t) is 
enhanced by the arbitrarily large size of A, — A,.) 

Because of the above factors, the level of complexity associated with 
the optimum receiver is prohibitive, and thus even for our simplified 
model we must resort to suboptimum reception. In a sense, this is not 
surprising since the maximum likelihood receiver has at its disposal the 
entire observation record, and it is only in special cases that the optimum 
procedure can be implemented in a sequential manner. 


IV. SUBOPTIMUM RECEPTION 


In this section, we indicate several reasonable receivers suggested by 
the optimum receiver of the previous section. 


4.1 Data-directed receiver 


Our approach to deriving a useful suboptimum receiver is to remove, 
via approximation, the difficulties associated with implementing the 
optimum receiver—the principal simplification we will make is to take 
a decision-directed approach. We begin by noting that (25) would bea 
practical and realizable recursion if: (i) 02,/0A; depended only on AR 
and d,, and (ii) the optimum value of 4, depends only on z(kT — A;) 
and A;,. With these desiderata in mind, we note from (17) that 


on [lam LAr] = ~2ene(RT + By) ~ 


x z énlg((k — n)T — Ap + An) — 8((n —k)T + An — Ap)] (26) 
n# 


where the “dot” indicates the time derivative. The first concession we 
make to realizability is to neglect the second term in (26). Note that if 
a tone is not transmitted and the data levels are uncorrelated, then the 
expected value of this term is zero. We realize, of course, that this term 
would make a contribution whenever a timing-phase hit occurs; however, 
we are relying on the @[-] component of the likelihood to provide the 
dominant indication of this event. With this approximation, (25) reduces 
to 


Nh+i = Me + 26p2(RT + Ax), (27) 
and from (24) we have 
mk+1 = (Apgi — Ap)G[(An+1 — Ax). 


The value of Ay+; may be generated from n+; and A; via the inverse 
relation 


Anti = Ap + FO metal, (28) 
where if 7 = xG[x] = F[x], then F-1[ ] is defined by 
x =F-1[y]. (29) 
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Thus (27) and (28) provide a second-order system of iterative equa- 
tions for generating the desired estimates of {A,} provided: (i) that either 
the sequence {c;} is known or reliable decisions are available, and (ii) that 
the initial phase estimate Ay is known. These equations may be viewed 
as a second-order, discrete-time phase-locked loop (PLL) with a non- 
linearity F—1{ | necessitated by the dual-mode nature of the timing 
phase. 

The function F-![x] is plotted in Fig. 3 for pp ~ 0 and a2 « p?. Note 
that this function is odd, exhibits hysteresis, and is multivalued over a 
certain range, and as shown in Fig. 3, F~1[y] can be approximated by the 
two straight-line segments 








0 Po_ Fb 
a < 90) ~ VFa¥/ loge (—PX—*) 
ON," In| Sm o 22 eee 
2 3 
F-\n) = ee > 7QDx VP 4/10 ( Po a) 
=) ONG ie TENT = po 0 
“a i2 
— nor —-— 7, M< < nf), 30 
oN, ON,” 7 In| Sn (30) 


With regard to the recursion (28), the parameters o2/No and p2/No can 


F-'(n) 





Fig. 3—The nonlinearity F—! [n] of (30). 
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be interpreted as a small and a large “step-size,” respectively. The 
magnitude of the step-size depends on the “old” n, and the “processed” 
observation ¢,2(kT + A;), and a typical trajectory of step-sizes is indi- 
cated by the arrows in Fig. 3. 

Returning to (17), we rewrite this expression as 


£i[{am}, {Am}] = —-2)) Cmz(mT + An) + > cf &0 


+ Ds » CmCpnE ((m a k)T + Ar a Am): (31) 
m#kRkR 


If the system pulse shape is Nyquist [g(n — k)T = go6,—x], then the third 
summation will be close to zero when the {A,,} are approximately equal 
over the duration of g(t). With this approximation in mind, we neglect 
the cross term (m_~ k) and complete the square to obtain 


£i{fam}, {Am}] =X {go (cm _ in}? : Za] 


m 80 8&0 
= (to (2m ~ [2- ot-u"])* 2 
and thus 
din = Q| = (—1)" , (32a) 
8&0 


where Q[ | is a function which quantizes its argument to the nearest 
symbol level. Observe that, with the assumptions we have made, the 
optimum value of d,, depends only on zm =z(mT + A,,) and is in fact 
the symbol level closest to (2m/go) — p(—1)™. The receiver sketched in 
Fig. 4 implements (27), (28), and (32a). 

If the system pulse shape is of the partial response type,? then a 
modified procedure is called for. We illustrate this technique when the 
received pulse g’(t) is a Class IV partial response pulse and the shaping 


A 
mT +tAy, z 
z(mT +A,,) 






A A 2 
Am+1=4m + F [nm+1] 


Fig. 4—Data-directed timing loop and receiver. 
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is split between the transmitter and receiver, the overall* characteristic 
being j2T sin wT’. In this case, 2, = £o(0x — 54-2) and, again neglecting 
the delay differences A; — A,,, the likelihood becomes 


£,[lam}, {An}] 2) Cel tan) oF £0 (x CF, = Cmem=2) 


(32b) 


Because of the coupling between c,, and c,,—2, the optimization proce- 
dure to determine the {é,,}, from (32b), requires the use of dynamic 
programming (the Viterbi algorithm’). While the Viterbi algorithm (VA) 
can be implemented in a rather straightforward manner for Class IV 
partial response systems, the decoding delay in the VA makes tracking 
of the timing phase rather unwieldy, and consequently practical receivers 
would probably employ a suboptimum technique which directly exam- 
ines the output of the receiving filter, 


2(RT) = > cng’ (RT — nT — A,,) + v(RT). (32c) 


In the above equation, the samples are obtained from the output of the 
receiver filter and, neglecting timing jitter, we have 


z(kT) = Cr — Cr-—2 + Ve = dp + Vk, (32d) 


where {d;} = {c, — cy—9} is the dependent or correlated data sequence. 
For example, if the input data symbols c, assume the values +1, +3, then 
d;, would be one of the seven output values 0, +2, +4, +6. Practical de- 
tectors would quantize z, to one of the seven allowed output values, and 
the desired data {¢,} is recovered from the relation ¢;, = d, — dz—2, where 
the data are typically precoded2 to prevent an erroneous decision from 
propagating. Note that the partial response waveform can be written 
either as (32c) or as 2,d,g(t — nT — A,,), where d,, are the correlated 
output levels and g(t) is the minimum-bandwidth symmetric Nyquist 
pulse, sin (rt/T)/(at/T). If we adopt this latter representation, then our 
maximum likelihood development can proceed as before—the only ad- 
ditional approximation being that, while the various sequences of {d,,} 
are not all.equally likely, we have implicitly taken them to be equi- 
probable. Thus, an approximation to the optimum receiver shown in 
Fig. 5 would be to quantize z,, using (32c), to the nearest output level 
and to use the corresponding dn in (27) and (28). 

Returning to Fig. 4, we recall that (27) to (29) have the appearance of 
a second-order, discrete-time, phase-locked loop with a bi-variable 
step-size. The choice of step-size is dictated by the current value of n;, 
which provides a measurement of the “jump” A, — Ag- 1. A large value 
of nz+1 is indicative of a large jump, while a small value of nz, +1 reassures 


* Note that, in this case, g’(t) is antisymmetric and the receiver filter is not matched to 
the transmitter filter. 
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Nm+1 =m + 2d_ z(mT+Am) 






A A re 
Amit =Am + Fo! Inmet] 


Fig. 5—Data-directed timing loop and receiver for partial-response signaling. 


the tracking loop that its estimate of A; is close to the correct value. The 
nonlinearity shown in Fig. 3 is interpreted as providing hysteresis, since 
via (30) we know that in the range 7") < |n| < n® 


Arti = Ar + Br+inrti, (33) 


o2/No, if lne+1\ < n) and \nx| < n) 
u?/No, if [neil > n® or if [neti] > on and |[ne| > 1. 


The bi-variable step-size appearing in the tracking loop of Fig. 6 has the 
effect of adaptively varying the loop’s bandwidth and thus accelerating 
recovery from a delay jump. The omission of the quadratic term ap- 
pearing in (31) may prolong this recovery by several symbol intervals, 
but this is a small price to pay for the resulting simplicity in implemen- 
tation.* The receiver shown in Fig. 6 quantizes the filtered and sampled 
sequence with the aid of a decision-directed phase-locked tracking loop 


Br+i © 


A 
mT +Am 





aA. A 
Nm4+1 =m + 2Cm z(mMT+A,,) 


A A 
Amt =Am + B41 i m+1 





7 { 07/No, if IMmay 10!) and Im |< n'?) 
p27 No, if Itmaq (> 2) or if Inmag) > a! and Iam | > 9!) 
Fig. 6—Simplified data-directed receiver. 
* Omission of the quadratic term is tantamount to neglecting the amplitude transient, 
caused by the delay jump, which propagates through the channel and receiver filters. In 


other words, if one accepts the model given by (6), then any amplitude transients are im- 
plicitly neglected. 
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which provides the sequence of sampling phases, given knowledge of the 
maximum-likelihood, initial-phase estimate, Ao. As described in a later 
section, the algorithm can be suitably modified to incorporate an esti- 
mate of Ap. The above decision-directed timing loop is similar to that 
described by Gitlin and Salz,® but the novel aspects here are the bi- 
variable step-size and the hysteresis associated with the tracking loop 
nonlinearity. 


4.2 Modifications to the decision-directed receiver 


A drawback of the receiver described in Section 4.1 is the possibility 
of a relatively long error burst following a timing-phase jump. Suppose 
such a delay jump causes a large deviation from the optimum sampling 
time; in a bandlimited system with a narrow eye-opening,’ this results 
in a large amount of intersymbol interference and consequently a high 
probability of error in the next symbol interval. The resulting incorrect 
decision, used in the decision-directed timing recovery loop, may move 
the estimated sampling phase in the wrong direction, further increasing 
the intersymbol interference. This type of effect is called runaway and 
is possible in nearly all decision-directed parameter tracking systems. 
Runaway is of particular concern in Class IV partial-response systems 
since the eye is open only for a small fraction of the symbol interval.1° 
The possibility of runaway is further enhanced by our neglecting the 
quadratic cross-term appearing in (31). 

To diminish the possibility of error proliferation due to delay jumps 
in a bandwidth-limited system, we propose the coarse-quantized timing 
recovery system shown in Fig. 7. The incoming signal is sampled at times 
{mT + An + ((T/M), 0 <i SM — 1}, where M is some integer, i.e., the 
receiver samples are taken at the rate M/T instead of 1/T samples/s. The 
sampling phase is still controlled by a single tracking loop and, as before, 
after suitable filtering, each sample is quantized to the nearest data level. 
The number of samples M is chosen large enough that T/M is less than 
the width of the eye-opening corresponding to the pulse g(t). Thus for 
a system with an open eye,? in the absence of noise, at least one of the 
sampling phases {A,, + (iT/M)} will result in a correct output* decision. 
Expressed mathematically, for at least one integer “7,” the maximum 
possible interference, 


max >> 
{cm} m0 


’ 


iT 
emg (mT + km + aa) 
me M 
is less than the minimum distance between two possible received signal 
levels. 
The idea behind the increased sampling rate (which might be readily 








* For a four-input level, Class IV partial response system, recall that the output sequence 
is chosen from one of seven levels. 
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Fig. 7—Coarse-quantized timing recovery system. 






COARSE TIMING PHASE 
mT' 






available in a digital receiver) is that one of the M possible coarsely 
quantized sampling phases results in an open eye, and therefore in the 
absence of noise, supplies a correct sequence {c,} suitable for updating 
the decision-directed phase-tracking loop. The particular (coarse) timing 
phase chosen to supply the decision sequence used by the tracking loop 
is determined by reformulating the problem as picking the static timing 
epoch {iT /M} which maximizes the a posteriori likelihood over the recent 
past. This strategy is mechanized from (32a) by computing a running 


likelihood 
bese act k 2 
2 le? {Anbll =o [ef —<-2 (iT + a+ al 
j=k—-K £0 M 
1 k UT’ 
me gh A), (34 
£0 jek d 7M 


where K is some suitably chosen number, {c{} are the decisions? corre- 
sponding to the sampling phase 1T/M + Aj, and the decisions are ob- 
tained by quantizing the appropriate output sample. The coarse-timing 
phase, i*7/M, is chosen if £{) < é{ for alli ¥ i*. In practice, one might 
use asmall number of sampling epochs; e.g., three, which would bracket 
the correct phase. The sum in (34) is truncated to run over a finite span 
of duration KT seconds so that the effects of ancient delay jumps do not 
affect the current sampling epoch. Once the best coarse timing phase 


* The A; are supplied by the phase-locked loop driven by the decisions corresponding to 
the current most likely coarse-quantized timing phase. 
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is determined, the bi-variable step-size PLL previously described is used 
to determine the exact sampling phase. Whenever a new sampling phase 
becomes the most likely, the current estimate of the timing phase is in- 
cremented by the appropriate amount.* It should be pointed out that 
additive noise following a delay jump may prolong the recovery some- 
what. This effect is difficult to assess analytically, and a similar statement 
can be made concerning the size of M (the number of timing epochs) and 
K (the memory of the running likelihood). The sensitivity of system 
performance to these parameters is best determined experimentally. 

The realization shown in Fig. 7, which incorporates the coarse- 
quantized timing recovery scheme, has the related mechanization de- 
picted in Fig. 8, which is specialized to a Class IV partial-response system. 
Here the quantized seven-level outputs are computed for each sampling 
phase, and each sequence is monitored for partial-response violations. 
The coarse-quantized sampling phase used to control the timing tracking 
loop is chosen as the phase which has the fewest associated partial re- 
sponse violations. Again, the actual logic which dictates when and how 
switches to a new timing phase are accomplished is probably best de- 
termined by an experimental and/or simulation study of the actual 
system. 


4.3 A refined loop which estimates Ao 


As it stands, the bi-modal phase-locked loop described by (27) and 
(28) is initialized from a random or arbitrary initial condition, Ap. A 
consequence of this initialization is that in either mode (delay hit/no 
delay hit) the loop exhibits a double integration (or direct second dif- 
ference) structure. We now show that, when a constant step-size is used, 
the algorithm is potentially unstable. We begin by recalling that in ei- 
ther mode the tracking loop is governed by equations of the form 


Nk+1 = Mh + cp2(RT + Az) (27) 
Arti = Ap + Beart, (33) 
where @; is a positive nonincreasing sequence.. 
i 

mT! +4 PICK OUTPUT 
r(t) 1 poereac st SEVEN-LEVEL o—{ER DECODER = NE WES) | ss TIMING 
nae COMMUTATING . P.R. VIOLATIONS PHASE 

SWITCH IN LAST K BITS 


P. R. DECODER 
M 


Fig. 8—Coarse-quantized timing recovery for partial-response system. 


t For example, suppose ¢{!) was the maximum and the new maximum is ¢{!"), then the 
timing phase should be incremented by (i* — i) T/M s. 

t This instability can be regarded as a manifestation of the sensitivity of the system 
equations to the initial unknown phase; i.e., the effect of a wrong choice of this phase 
propagates endlessly. This is a consequence of viewing the system of simultaneous equa- 
Hone for the timing-phase estimate, (27) and (28), as a recursion with an arbitrary initial 
condition. 
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For the purpose of this discussion, we consider a fixed but unknown 
delay, A, and examine the average values of the above equations, i.e., 


Nk+1 = Te + S(Ap — A) (35a) 
Anti = Ar + Beiney (35b) 


where the overbar denotes expectation. We can combine the above 
equations to obtain the recursion 








Keri ~ (1+ Be ) +P a, =peGr- A), — (36) 

Br-1 Br-1 

and if we denote the tracking error by 
ep = Ar —A F (37) 


then we have 





€h+1 — (1 + i é, + oe ~ B,[g(0) + «&(0)] 


= Br&(O)ex, (38) 


where we have used a Taylor Series expansion which is valid for small 
ex. Note that the solution to the above time-varying difference equation 
will decay when the product of the “instantaneous roots,” 8;/8;—, is less 
than unity. However, if we were to use a constant step-size, i.e., Bp = 
Br—1 = B, and if Z(0) ~ 0, then the solutions are of the form ¢; = eg sin k6; 
i.e., the error does not decay to zero but oscillates as soon as the error 
penetrates the linear region (clearly, this is an unacceptable situation). 
The existence of oscillations can be deduced directly from (27) and (28). 
Note from (28) that A; is the accumulated sum of the past errors. When 
a phase-hit occurs, this sum will become large and the loop will enter the 
large step-size mode. In order that the loop ultimately converge to the 
correct phase, it is clear that the accumulator will have to “see” many 
terms opposite in sign to the original accumulants, i.e., the loop can os- 
cillate. 

In the light of the above discussion, we now derive an estimate of Ap 
and indicate how this estimate may be incorporated into the existing 
timing loop to produce a stable loop. We first let 





Dn = Din = AG; (39) 
and in terms of {A,,} we have from (20) 
® 1 — Po) ; 
fella] = No & Tog {SPO exp {Cn — Ain -1)/20% 
+ Po — exp {-(Ain, — Ain—1)?/2u}1, (40) 
be 


V2a0 
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where we note that Ao = 0; it is our intention to estimate {A,,} and {Ao} 
separately and then to combine these quantities to construct {A,}. 
Proceeding as before, we define 
Tk = (Ay — Ay—1)G[A, — Ay-il, (41) 
and taking the derivative of the likelihood with respect to A; gives 


, 5a Of; [{an}, {Aint, Ao] 
Vk+1 — Tk 


oA,,. 
= np + 2cpe(RT + A; + Ao). (42) 
Inverting (41) gives 
Anti = Ay + F-2[npsil, (43) 


where F-![ | has been previously defined and where (42) and (48) are 
initialized with Ao = 0. 

An estimate of Ap will be obtained by applying stochastic approxi- 
mation theory,!! and using as the increment the derivative of the current 
term in the likelihood, @;[{am}, {A;,}, Ao], with respect to Ao. 

The resulting stochastic approximation algorithm for the estimate 
of Ao is 

Aoe+1 = Aor = VRCRE(RT + A, + Aor) , k = 0, 1, 2, StS fy 
(44) 
where y; is a positive step-size sequence. Since the estimate of A; is the 
sum of the component estimates, i.e., 


A, = A, t+ Aor, (45) 


adding (48) and (44) gives the structure shown in Fig. 9, which imple- 
ments the recursions: 
Apei = Ap + Fo" [feel + veer (kT + Ap) (46) 
her = hk + 2epa(RT + Ap). (47) 
In implementing (46) and (47), the step-size y, would probably be 
switched to a larger step-size whenever the F'~![-] function indicates that 
a mode switch is taking place—this can be thought of as reinitializing 
the estimate of Ap. Contrasting (46) and (47) with (27) and (28), we see 
that, when the state of the system is such that F-1|n] = Gn, the latter 
system can be written as the second-order difference equation 


Apnt+1 — 2Apn + Ap—1 = Bep2(RT + Ax), (48) 
while the former system is equivalent to 
Apg—1 — 2Az + Ap-1 = (ve + B)ceZ(RT + Ax) 
— Yr-10n-12(RT — T + Ap-3). (49) 
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The effect of the direct feeding of the input, y,c,-12(kT — T + Ap-3), 
to the second summer in Fig. 9 can be seen by considering the evolution 
of the average phase error, (37). It is clear from (49) that with 7; = y the 
modified tracking-loop structure can provide roots within the unit circle 
and hence eliminate the possibility of oscillations. 


V. APPLICATION IN A SIMULATED CLASS IV 
PARTIAL-RESPONSE SYSTEM 

The application of dual-mode, decision-directed timing recovery and 
coarse-quantized timing recovery to a data communication system 
subject to additive noise and occasional delay jumps was tested by means 
of a computer simulation of a digital version of the baseband data 
transmission system shown in Fig. 10. Transmission through radio 
channels was modeled by the addition of additive white Gaussian noise 
to the signal and the insertion of abrupt delay changes. Since the simu- 
lated system is not an exact replica of the idealized equations used for 
analysis, the actual receiver differed in some small details from the 
structure previously derived. 

A seven-level, Class IV, partial-response waveform was generated in 
sampled form with sampling rate 10/T (10 times the symbol rate), and 
channel and receiver signal processing were also carried out digitally at 
this sampling rate. 

Because the simulation was carried out in nonreal time on a digital 
computer, exact realization of time delays of other than multiples of T/10 
was not possible. Moreover, the actual sampling phase of the over- 
sampled input was not under the receiver’s control. Instead, arbitrary 
channel and receiver sampling delays were approximated by linear in- 
terpolation. The samples at the output of the receiver filter were denoted 
{z(mT + iT’); m = 0,1, 2,-++,0;1 = 0,1, +--+ ,9}. 

The index i denotes a timing phase, quantized to a multiple of 
T’ = T/10. The output sampled at mT + iT’ + A,, was taken to be, by 
linear interpolation, 


z(mT + iT’ + An) ; 
= (1-)enT + iT) +E elm + G+ 1). (60) 
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Fig. 9—Modified data-directed, fine-tuned, timing loop. . 
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Fig. 10—Simulation of Class IV partial response system with delay hits. 


The quantity A, (0 < A,, < T”) is the receiver’s estimate of the sampling 
phase in the mth symbol interval, mod T’. 

The 10-fold oversampling also permits the application of the coarse- 
quantized timing recovery method described in Section 4.2, with M = 
10. Each samplet z(mT' + iT’ + A,,) is quantized into ¢ = FXG + p(—1)™ 
where d“ is one of the seven levels d = 0, + 2, +4, 6, and an error e) 
is formed as: 


e) =2(mT +iT’ + A,,) — 6. (51) 


For each integer 1 from 0 to 9, the sum of squared errors over the past 
K symbol intervals was formed 


The values of K used in the simulations were 20 and 40. 

That integer i = i* which minimized £{ was taken to be the current 
most likely coarse-quantized sampling PEnEE, Whenever i = 1* (once per 
symbol interval) the current decision ¢“) — p(—1)™ = d“) is passed on 
as the receiver’s decision on d,,. The program records the occurrence of 
errors (discrepancies between id “*) and dm). Note that the above defi- 
nition of £{? differs from that proposed in Section 4.2 in the omission 
of the sum of squares of z-samples. A further modification was the in- 
hibition of a change in i* when 2@{) is greater than 90 percent of @{"). This 
“dead zone” modification reduced the occurrence of switches back and 
forth between two values of i for which the values of @{!) are nearly 
equal. 

We remark that abrupt changes in the intervals between receiver 
output samples should not be passed on to the data recipient. The re- 
ceiver’s output samples would in practice enter an elastic buffer and be 
clocked out under the control of a very narrowband phase- pecned 
loop. 

The value of A,, used in the interpolative sampling procedure was 
obtained by a digital implementation of the decision-directed, second- 
order timing recovery algorithm described in Section 4.3. Instead of using 
the correction term ¢“") z(mT + A,, + i*T”’) in the loop, we use an ap- 
proximation to the nesative of the derivative (gradient) of the squared 
error e{")? with respect to A,,3 i.e., 


bm = —e&) [z(mT + (i* + 1)T’) — 2(mT + i*T’)]. (52) 


When the loop error is zero, the modified correction term (52) guarantees 
that no adjustment will be made, while the original correction term only 
provides this condition on the average. 


+ The overall impulse response was scaled so that the ideal sampled outputs in the ab- 
sence of noise are 0, 2, +4, 
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The average value of the correction term 6,, defined by (52) can be 
computed from equations (51), (82c), and (32d) on an ideal Class IV 
partial-response system for which 


_ (at . at —2T) 
emenie) Cr) 


(r)  (R) 


For small timing errors (An — A,, — i*T’) between the true phase A,, 
and the estimated phase A,, + 1*T” (that is, neglecting quadratic and 
higher order terms in (A,, — A,, — 1*T’)/T’), the linearized average value 
of 6, is 


(im) © 0.278 (“= (54) 


Am —i*T’ 
”) 
when a timing tone is not transmitted (p = 0). The corresponding lin- 
earized average value of the correction term (A 3(mT + Ay +i*T’)) 
can similarly be shown to equal this same quantity. Note that the cor- 
rection term 6,, given by (52) arises from an attempt to minimize the 
mean-squared error of a linear interpolation scheme applied to a digital 
receiver whose actual input sampling phase is not under its control. Thus, 
we have established a connection between this simple linear interpola- 
tion scheme with a mean-squared-error optimality criterion and the 
decision-directed, timing-phase recovery scheme dictated by minimum 
error probability considerations. 

As prescribed in Section IV, a second-order decision-directed, 
sampling-phase, updating algorithm including a direct correction term 
and a cumulative correction term is used. The following equations 
summarize the simulated receiver’s operation: 

(t) Interpolative sampling: 


z(mT + iT’ + Am) = (1 3) 2(mT + iT’) 


A 





). 
(tt) Quantization: 
d = quantization of [z(mT + iT’ + An) — p(-1)™] 
ce) = d® + p(-1)™. 
(iit) Error: 
eY=2(mT +iT’ + Ap) — 6. 


(tv) Coarse-quantized timing recovery: 
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1* = value of 1 which minimizes 


(P= FS eft 
m=k-K 
(vu) Decision-directed fine timing recovery: 
Post = a + ¥m6m + Bmim+1> 
where 
bm = ek” [z(mT + (i* + 1)T’) — z(mT + i*T’)] 
and 
tm+1 = (1 — @)nm + bm; 

with 


Ao = no = 0. 

Since in the simulated and real systems, the timing transient does not 
instantaneously affect the received signal, the recursion defining 7m+1 
introduces a small amount of “leakage,” represented by a = 0.0005. 
Suitable values of the second-order loop parameters y,, and 8,, in the 
narrowband and wideband modes were established by loop-bandwidth 
considerations and observations of the transient response of A,, to 
simulated delay jumps. The parameter values picked for the narrowband 
mode were 


Ym = 0.005 
and 
Bm = 3.42 X 10-6, 


Assuming the linearized average correction term of (54), we have a 
discrete-time linear model of the second-order fine-timing recovery loop 
shown in Fig. 11. This loop’s bandwidth, for the above values of y,, and 
Bm and 1/T = 772 kHz, is readily shown to be 240 Hz. 

The wideband mode is initiated first whenever the value of i* is 
' changed by the coarse-quantized algorithm, or second whenever the 
following recursively generated quantity exceeds a threshold: 


fo eee = 0.998 7» + Oi (55) 


The quantity S,, is a weighted average of past correction terms, in con- 
trast to the cumulative sum of all past correction terms envisaged 
in (47). 
With the initiation of the wideband mode, y, is set to 1 and 8, to 
3.42 X 10-4. Thereafter, 
Ym = max (0.005, 1/L) 


Bm = 3.42X10-4/L upto L = 200, 
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Fig. 11—Discrete time linearized, second-order loop model. 


where L is the number of symbol intervals elapsed since initiation of the 
wideband mode. Thus, the initiation of the wideband mode restarts a 
stochastic approximation algorithm, with step-size decreasing toward 
a fixed minimum value. The duration of the wideband mode is 200 
symbol intervals, after which the narrowband mode resumes. 


Vi. SIMULATION RESULTS 


The channel model and receiver structure described in Section V were 
simulated with a 24-dB signal-to-noise ratio and with the insertion of 
occasional delay hits. 

The value of K (the number of past squared errors stored by the 
coarse-quantized timing recovery algorithm) was set to either 20 or 40. 
Transmission both with and without a —18-dB (p = 0.554) 1/2T tone was 
simulated. The results are summarized in Fig. 12, which displays the 
observed average number of symbol errors (errors in d“*)) vs the delay 
hit expressed as a fraction of a symbol interval. 

Each average plotted in Fig. 12 is only over five delay hits of the same 
magnitude, and thus the curves display considerable variability. Nev- 
ertheless, it is clear that a receiver employing two-mode decision-directed 
and coarse-quantized timing recovery can tolerate delay hits of up to 
almost half a symbol interval, while sustaining error bursts on the order 
of a dozen or less, rather than several thousand, which might be expected 
in a conventional tracking loop with a bandwidth on the order of 100 Hz. 
Greater delay hits unavoidably cause the deletion or repetition of 
data. 

The number of errors sustained roughly doubled as K was doubled 
from 20 to 40. This is understandable, since the delay in detecting a phase 
change, by the coarse-quantized timing recovery system, is proportional 
to K. The risk of “false-alarm switching” decreases with K, and therefore 
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Fig. 12—Average number of errors produced by delay jumps in the simulated system 
employing two-mode, decision-directed and coarse-quantized timing recovery. 


a relatively large value of K such as 40 may be worth the price of, say, 
a dozen extra errors sustained per delay jump. On the other hand, de- 
creasing K will increase the number of errors due to “false-alarm” 
switching. The optimum value of K can best be determined by experi- 
ence with a real system. 

It is interesting to note from the curve that the presence or absence 
of a transmitted timing tone 18 dB below the data signal does not make 
a dramatic difference in the robustness of the system against delay hits. 
It therefore appears safe to omit the tone in a system employing 
decision-directed and coarse-quantized timing recovery. We note that 
the simulated system displayed rapid start-up characteristics in the 
coarse-quantized, decision-directed mode. The timing phase, correct 
to within 7/20, was acquired in 20 to 25 symbol intervals in the absence 
of a transmitted tone. The transmission of a —18-dB tone unaccountably 
delayed timing-phase acquisition during start-up. 

Figure 13 shows the evolution of the receiver’s sampling-phase esti- 
mate following a delay jump of —1.5T’. The horizontal coordinate is the 
number of elapsed symbol intervals. The dotted curves show the phase 
estimate i*7’ + A,, (quantized by the limited resolution of the computer 
plotting routine). The x’s at height 8.5 indicate the occurrence of symbol 
errors. In this example, the errors occur after the delay hit but before 
initiation of a coarse-quantized timing jump. 


3206 THE BELL SYSTEM TECHNICAL JOURNAL, NOVEMBER 1978 


10T’ 


be , 
Q OT 
ay 
[i 
Oo 
z 
a ERRORS 
= x xXx 
cag 
Ww 
[oaq 
lw 
> 
AY 
i OT" 
a 

77’ 

OT 10T 20T 30T 40T  50T GOT 70T  80T  Q9OT  100T 


TIME (t) 
Fig. 13—Response of system to delay jump applied at t = 0. 


Vil. CONCLUSIONS 


The technique of data-directed, coarse-quantized, dual-mode timing 
recovery has been derived and applied to the rapid acquisition of timing 
phase in systems subject to delay hits. In a.simulated system, typical 
error-burst lengths, following a timing discontinuity of up to a half 
symbol interval, have been reduced to a dozen or so—two orders of 
magnitude less than that expected with a conventional phase-locked 
tracking system. Furthermore, the derivation and simulations have 
demonstrated the viability of these timing-recovery techniques in the 
absence of a transmitted pilot tone. 
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Caustic Patterns Associated With 
Wielt Zones in Solidified Glass Samples— 
Part I: Symmetric Cases 


By T. D. DUDDERAR, J. B. SEERY, and P. G. SIMPKINS 
(Manuscript received March 17, 1978) 


Ray-tracing algorithms have been developed to follow the propaga- 
tion of a collimated beam of light traveling along and refracting out of 
a glass rod ina region of monotonically decreasing cross section. These 
algorithms have been used to study the formation and distribution of 
caustics as a function of the changing cross-section area. Axial profile 
data taken from the melt, or drawdown, zone of a solidified fiber- 
drawing sample provide the geometrical information needed to predict 
the loci of two major and two minor families of caustics. General 
principles for relating the observable far-field caustic patterns to the 
actual shapes of symmetric melt zones in glass samples are dis- 
cussed. 


I. INTRODUCTION 


When a plane light wavefront propagates along a cylindrical glass rod 
in which a rapid monotonic decrease in cross section occurs, some light 
may be refracted from the glass and become externally visible. For a 
sample with homogeneous optical properties, the amount of emerging 
light and its intensity distribution are strongly influenced by the rate 
at which the cross section decreases. In a previous study of melt, or 
drawdown, zones of solidified samples taken from a laser-heated fiber- 
drawing system, the boundaries between the regions of emitted light and 
shadow were seen to be loci of intense illumination properly identified 
as “‘caustics.”! These caustics were shown, by both experiment and 
analysis, to arise from various internal reflections and a refraction of the 
light from the surface. It was noted that the number of caustics increases 
as the rate of change of the cross section increases. No light is emitted 
from a very gradually tapered sample, while a great deal of light and 
numerous caustics are emitted from a sample with a very rapid taper. 
Also, as the rate of change of the cross section increases, the propagation 
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vectors of the light rays which form the far-field caustic patterns rotate; 
i.e., from the pulling or downstream direction through the radial and 
toward the upstream direction. 

During the initial investigation, ray-tracing algorithms were developed 
that permitted accurate calculation of the light paths necessary to 
identify the two principal caustics experimentally observed. These 
routines utilize actual melt zone profile data and polynomial spline- 
fitting procedures to provide the geometrical information necessary to 
describe the caustics for a given value of the index of refraction n. Ex- 
perimental results from four radically different samples compared very 
favorably with those obtained from the algorithm within the limitations 
of the accuracy of the profile data itself. 

The present study uses the algorithms to investigate the detailed re- 
sponse of the caustic loci to systematic changes in the melt zone geom- 
etry. Only rotationally symmetric homogeneous examples were con- 
sidered. Results were obtained for a far greater range of melt zone tapers 
than were originally investigated experimentally. 


il. PROCEDURE 


None of the four samples discussed in the earlier reports was in fact 
rotationally symmetric. Three were specifically selected because of their 
existing asymmetries. It was found that, for certain cases, these geometric 
asymmetries influenced the far-field caustic patterns quite strongly. In 
the present study, symmetric profile data were generated by averaging 
the least asymmetric coplanar profiles of the most gradually tapered 
sample (Sample 4 with 6 = 52.3 degrees). These data were then fitted 
by the same polynomial spline-fitting routine used earlier! to simulate 
a symmetric version of the original sample as shown in Fig. 1a. There, 
G is the angle between an axial ray and the outer normal to the profile 
at the inflection point J. Taking the slope of the outer normal as dx/dy, 
then 


6 = — arc tan dx/dy|,, 


where x is the axial location parameter and y is the radial location pa- 
rameter. The magnitude and location of the derivative at the inflection 
point are determined from the spline-fitting routine. 

It is worth comparing the calculated values of the caustic half-angles 
for the symmetric melt zone assuming n = 1.46 and the corresponding 
angles originally reported in Ref. 1. For example, the symmetric 
2-intercept caustic half-angle, 041, of 98.2 degrees, compares well with 
asymmetric half-angles of 86.8 and 104.7 degrees, for an average of 95.7 
degrees. Similarly, with the 3-intercept family, the symmetric data give 
a caustic half-angle, 6? = 148.5 degrees, whereas the half-angles of 
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MELT ZONE 


(a) (b) 


Fig. 1—(a) Symmetric profiles of Sample 4 showing coordinates and inflection point. 
Here 8 = 52.92 degrees, comparable to original data. (b) Symmetric profiles of Sample 
4 (as above) showing limiting ray paths for 2-intercept caustics with 04! = 98.2 degrees 
and for 3-intercept caustics with 64 = 148.5 degrees. 


138.1 and 154.5 degrees yield an average of 146.3 degrees from the original 
unsymmetric data.* 

The effects of changes in the taper of a symmetric melt zone were 
revealed by stretching one of the data coordinates by a scale factor before 
each calculation. With this procedure, increasing the scale factor pro- 
duces an increase in @ and hence a more gradually tapered melt zone. 
Conversely, decreasing the scale factor decreases 3 and increases the 
taper. The slope of the outer normal, dx/dy, at any point on the profile 
clearly varies linearly with the scale factor. 

To quantitatively determine the caustic half-angle, 04, as a function 
of geometry over the maximum possible range, over 130 differently scaled 
melt zones were analyzed. To establish the generality of these results, 
the analysis was repeated using data derived from the least symmetric 
and most sharply tapered of the original samples (Sample 3 with 
GB =~ 27.5 degrees). 


lll. THE CAUSTICS 


The two principal families of caustics are of primary interest because 
they appear over the greatest range of tapers. The first of these caustics 
is formed by light which reflects internally from a given side, crosses the 
axis of the melt zone, and is refracted out of the opposite side as shown 
in Fig. 1b. The second caustic family is due to light which makes two 
reflections on the initial side, then crosses the axis and is refracted out 
of the opposite side, as also shown in Fig. 1b. Hereafter, these first 


* All angles are measured from the upstream axial direction (see Fig. 1b), whereas in 
snes 1 caustic half-angles for the 3-intercept family were measured from the opposite 
irection. 
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and second caustic families will be referred to as “2-intercept” and ‘‘3- 
intercept,” respectively.* 

Two caustics of lesser interest are also briefly discussed later in this 
report. The first of these is a “1-intercept” caustic which, as its name 
implies, is refracted from the glass on its first interception with the 
surface. The second arises from light which, like the 3-intercept family, 
reflects twice from the first side before crossing the sample. However, 
its interception with the opposite side results in an initial reflection and 
it then refracts from the glass upon its second interception with that side. 
This is referred to as a ““4-intercept” caustic. 

Figure 2 presents a plot of the rays which are refracted from the 
drawdown zone in a sample with 6 = 69.3 degrees when 2-intercept light 
only is emitted. Illuminating rays propagating at greater radial distances 
than ray 1 or lesser radial distances than ray 3 intercept the second side 
at angles greater than the critical angle.t Consequently, they are con- 
tinuously internally reflected and propagate on down the fiber. All the 
rays between bounding rays 1 and 3 refract out. Ray 2 represents that 
ray which is incident at the point of maximum slope (labeled J in Fig. 
2) and is therefore turned through the greatest angle. That ray conse- 
quently forms a catacaustic, i.e., a caustic by reflection, within the glass. 
This caustic travels across the melt zone and forms a visible external 
caustic when refracted out on the opposite side. From Fig. 2 we see that 
rays originating on either side of ray 2 are refracted out at angles greater 
than the ray initially incident at the inflection point. 

It should be observed in Fig. 2 that the rays between 1 and 3 which are 
initially distributed evenly become concentrated near the caustic ray 




























































































_ Fig. 2—Symmetric profiles of Sample 4 scaled to give only 2-intercept caustics. The 
figure shows all trajectories for rays refracting out of the sample between bounding 
rays 1 and 8 on either side of limiting caustic ray 2. Here 6 = 69.3 degrees. 


* In Ref. 1, these were referred to as “upstream” and “downstream” caustics because this 
described their far-field propagation directions as observed in the first of the four original 
samples studied experimentally. 

t Taken as 43.2 degrees, assuming an index of refraction of 1.46 (at the reference wave- 
length) for fused silica. 
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as it develops. This concentration of rays symbolizes the intensification 
of light found along the far-field caustic loci associated with ray 2. 
Conversely, the rays become widely separated as the bounding rays 1 
and 3 are approached, representing a decrease in intensity. 

It will be seen that, as G decreases, other 2-intercept caustics appear 
which are not associated with the internal catacaustic formed at the 
inflection point. These are due to the refraction of an internal fan of light 
initially produced by reflection. When the final refraction causes the light 
to gather into a caustic, it is called a diacaustic. 

The 8-intercept caustics also involve an internal catacaustic, due to 
the interplay between the two initial reflections rather than from the 
inflection point. In this case, the internal caustic rays originate from rays 
propagating near the surface of the sample and finally emerge as the 
far-field caustic rays after refraction. To the best of our understanding, 
no other 3-intercept rays form externally visible caustics in symmetric 
melt zones of homogeneous glass. 


IV. RESULTS 


This section describes the development of the caustic field as a func- 
tion of the melt zone geometry for the fourth sample. The history of 
Sample 3 is similar. The reader who is not interested in specific details 
should proceed to Section V. 

We begin by considering melt zone examples with gradual tapers that 
emit little light and a single caustic. By systematically increasing the 
taper, we observe an increase in the amount of light emitted and corre- 
sponding increases in the number and complexity of the associated 
caustics. We interpret these results to give the reader a detailed under- 
standing of their significance. 

Figure 2 shows a typical distribution of rays throughout the sample 
including the caustic and bounding ray trajectories. Hereafter, for clarity, 
we show only the limiting rays (i.e., caustic and bounding rays). However, 
the reader is reminded that the ultimate intensity distribution is always 
nonuniform, being much brighter at a caustic and decaying severely as 
an extinction (e.g., by internal reflection) boundary is approached. 
Further simplification is effected by separating the graphical information 
as follows: The melt zone profile and the internal and external caustic 
rays as they appear are shown in Fig. 3. No other rays are shown. Figures 
4 and 5 are “polar plots” of all the far-field limiting rays, for the 
2-intercept and 3-intercept families, respectively. The scales are greatly 
magnified so that the collimated beam is represented as a single hori- 
zontal arrow propagating from left to right, and the sample profile is 
represented as a point. The far-field caustic rays are shown as solid lines 
and the bounding rays as broken lines. The circumferential arrows follow 
the continuous fan of light from the outermost bounding ray to the in- 
nermost bounding ray, including all caustic rays. 
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B = 69.3° 





(a) (b) 
B = 45.5° B = 39.8° B = 249° 
(d) (<) (f) 


Fig. 3—Symmetric profiles of Sample 4 showing internal and external ray paths for the 
limiting caustic rays as they appear. (a) 6 = 69.3 degrees. Only one 2-intercept caustic is 
present. (b) 8 = 60.6 degrees. Both 2- and 3-intercept caustics are present. (c) 6 = 45.7 
degrees. Both 2- and 3-intercept caustics are present. (d) 8 = 45.5 degrees. Only the 3- 
intercept caustic is present. (e) 8 = 39.8 degrees. 1-, 2-, and 3-intercept caustics are present. 
(f) 8 = 24.9 degrees. 1-, multiple 2-, and 3-intercept caustics are present. 


In Fig. 4, the 2-intercept rays are numbered sequentially beginning 
with the outermost ray 1, which originates near the surface, and in- 
creasing inward. The highest numbered ray represents that which ini- 
tially propagates nearest the sample core and ultimately refracts out.* 
The 3-intercept rays are lettered A through C or D, as shown in Fig. 5. 
Each of Figs. 3, 4, and 5 are repeated for a succession of scaled profiles, 
as shown. The complete range covered in this investigation extends from 
GB = 72 to B = 6.8 degrees. In Ref. 1, the corresponding range extended 
from 538 to about 31.3 degrees. 

Figure 3a is the same as Fig. 2, but with the bounding and intermediate 
rays omitted. The corresponding far-field caustic and bounding ray 
trajectories are shown in Fig. 4a for the 2-intercept family and Fig. 5a 
for the 3-intercept family (where the light simply propagates along the 
sample). Figure 3b shows the emergence of the 3-intercept caustic and 
a partially rotated 2-intercept caustic. (See also Figs. 4b and 5b.) The 
2-intercept caustic originates at the inflection point, as it always must, 
while the 3-intercept ray originates from a point close to the initial 
change in the sample cross section, as reported in Ref. 1. 

As 6 continues to decrease, both caustics rotate in an upstream di- 


* Actually, of course, the light is not made up of discrete rays but comprises a continuum, 
including rays on either side of the limiting rays shown. Because of the concentration of 
rays at the caustic, discreteness errors made in identifying the caustic angles are very small, 
while those associated with locating the extinction boundaries are substantially larger. 
The ray spacing used in the present study was optimized so that the caustic angles could 
be located within +0.1 degree, while the bounding rays are accurate to within no better 
than +4 degrees. 
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7 
6 = 45.5° ; B = 39.8° 





(d) (e) (f) 


Fig. 4—Schematic diagram showing the external limiting rays for the 2-intercept light 
paths. The symmetric profiles are omitted, but the orientation is the same as in Fig. 3. The 
horizontal arrow represents the incoming beam of collimated light illuminating the sample 
which in turn deflects the light into the plotted ray paths in the far field. Here, the solid 
lines represent caustics and the broken lines represent bounding rays limited by internal 
reflection. The circumferential arrows follow the continuous fan of light from the outermost 
bounding ray to the innermost bounding ray including all caustic limiting rays. The rays 
are numbered in sequence beginning with 1 as the outermost ray, which originates nearest 
the surface, and increasing inward. The unnumbered ray in Figs. 4e and 4f represent the 
last ray after ray 3 which is fully reflected on its first interception with the profiles (except 
the last two rays, which are again fully reflected). Figures 4a through 4f relate to the same 
geometrical parameters as Figs. 3a through 3f. 


rection, or opposite the rotation of the outer normal at J. Figures 3b and 
3c show this rotation quite clearly. Simultaneously, the fans of light 
forming both caustics broaden (see Figs. 4b and c and 5b and c). How- 
ever, there is an important distinction between these caustics which may 
be seen by comparing any two, e.g., Fig. 4c with Fig. 5c. The 2-intercept 
fan angle is larger and “evenly developed” about the caustic. That is, in 
Fig. 4c, the fan angle from ray 1 to ray 2 is comparable to the fan angle 
from ray 2 to ray 3. In contrast, Fig. 5c shows an “unevenly developed” 
fan of light for the 3-intercept caustic; i.e., the fan angle from ray A to 
ray B is much greater than the fan angle from ray B to ray C. The reasons 
for this can be understood by considering how the fans of light are formed 
internally. The rays forming the 2-intercept caustics are bounded by rays 
1 and 3 defined by the internal reflection conditions on the second in- 
cident side (see Fig. 2), and therefore give rise to an evenly developed 
fan. Hereafter, we shall refer to these bounding rays as extinction rays 
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B = 45.5° 6 = 39.8° 





(d) (e) (£) 


Fig. 5—Schematic diagram showing limiting rays for the 3-intercept light paths. Figure 
5 is Ste be the same as Fig. 4, except that the ray sequence is lettered rather than 
numbered. 


because the internal reflection causes extinction of the corresponding 
external illumination. However, for the 3-intercept family only the 
outermost bounding ray, A, is an extinction ray defined by the reflection 
condition on the second side. In contrast, the innermost bounding ray, 
C, is determined by the presence of the inflection point, which limits the 
distribution of possible double reflections. In other words, ray C is not. 
an extinction ray as are rays 1, 3, and A, but rather C is a bounding ray 
determined by the geometrical conditions required for double reflections 
on the first incident side. 

Figures 3d and 4d represent a change of only 0.2 degree from the 8 of 
Fig. 3c and show the abrupt extinction of the external 2-intercept caustic 
due to a second incidence angle in excess of the critical angle, 43.2 de- 
grees. In this case, the 2-intercept catacaustic ray is internally reflected 
back up the sample, as shown. In Fig. 4d, it can be seen that most of 
the 2-intercept rays are still incident at angles less than the critical angle 
and therefore are refracted from the glass. However, two separate fans 
are formed which are bounded by extinction rays on all sides: rays 1 and 
2 and rays 38 and 4. At the same time the 3-intercept caustic simply 
continues to rotate upstream as seen in Figs. 3d and 5d. As the taper is 
increased further, 6 reaches the critical angle at a minimum outer normal 
slope of 0.98, causing most of the light in the catacaustic ray to refract 
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from the sample on the first intercept. This results in the family of 
l-intercept caustics associated with the inflection point mentioned 
earlier. Since the optical mechanism by which the light is concentrated 
is refraction, this is also a diacaustic. Initially, the only light emitted is 
the ray at the inflection point which exits tangent at the surface (at a 
half-angle of 226.8 degrees) and follows the glass surface downstream. 
As £ increases, the amount of light refracted increases and the caustic 
ray refracts from the glass at decreasing angles. It is important to note 
that this caustic half-angle is always greater than 180 degrees; 1.e., it is 
the only caustic that always propagates toward the axis of the sample. 
At 6 = 40.7 degrees, a new 2-intercept caustic emerges. Here the two 
fans of light bounded by extinction rays are still present. In addition, 
there is a small bundle of rays, including a caustic ray lying between 
them. This caustic is termed a second 2-intercept caustic because it 
differs considerably from the original 2-intercept caustic, as may be 
understood from Figs. 3e and 4e. In Fig. 3e, the usual 3-intercept caustic 
can be seen originating from the outermost illuminating ray. At this 
point, it refracts out almost normal to the surface of the melt zone. 
Therefore, it depends only on the geometry of the melt zone and it is 
nearly independent of the magnitude of the refractive index. The in- 
nermost illuminating ray is incident at the inflection point; it gives rise 
to two caustics: the external diacaustic (the 1-intercept family discussed 
above) and the catacaustic associated with the original 2-intercept 
caustic which has now been reflected internally. A new caustic ray is now 
shown between these illuminating rays. This new ray gives rise to the 
new 2-intercept caustic heading upstream in Fig. 3e and is designated 
ray 4 in Fig. 4e. Then one obvious difference between the second 
2-intercept caustic and the original 2-intercept caustic is that the second 
2-intercept caustic is not generated from the inflection point. Figure 4e 
illustrates a second difference: namely, that extinction rays 3 and 5 
propagate at smaller angles than caustic ray 4, rather than at larger an- 
gles as before. Recall, for example, Fig. 4c, where extinction rays 1 and 
3 propagate at much larger angles than caustic ray 2. In other words, the 
new caustic is formed by a folding in the opposite direction. 
Simultaneously, the fan of light formed by the 1-intercept caustic 
broadens as the band of illuminating rays refracting at the first inter- 
ception broadens. This band is evenly developed about the inflection 
point ray, or J ray, and has now expanded to include the rays giving rise 
to the new 2-intercept caustic. Consequently, most of the light from rays 
forming this caustic has refracted at the first interception as part of the 
fan of light which forms the 1-intercept caustic; thus, the second 
2-intercept caustic is much less intense than the original. When the 
second 2-intercept caustic first emerges, however, the band of illumi- 
nating rays refracting at the first interception was so small that it did 
not yet include the rays contributing to the new caustic. This is the case 
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at 3 = 40.7 degrees, where this caustic was still relatively intense. The 
loss of intensity is illustrated in Fig. 4e by an unnumbered ray repre- 
senting the last one that is fully reflected on its first interception, shown 
between rays 3 and 4. Referring to fan 3, 4, and 5, the rays between 3 and 
the unnumbered ray are totally internally reflected on their first inter- 
ception and are therefore intense. But the rays between the unnumbered 
ray and ray 4, then continuing to extinction ray 5, all are less intense since 
- much light is lost on their first interception. Note that all other fans of 
light are totally reflected on the first interception and are also in- 
tense. 

Concurrently, it was found that the first extinction rays 1 and A rotate 
away from their original downstream directions as 6 decreases from 40.7 
degrees. This is not the case for final extinction ray 7, while extinction 
ray C continues to stay close behind the 3-intercept caustic as it also 
swings upstream. 

At 8 = 37.8 degrees, the 2-intercept catacaustic ray, partially reflected 
at the inflection point from the second side, has re-emerged propagating 
in the upstream direction. At this stage, it does not form an external 
caustic as it did earlier. This ray, referred to as the “J ray,” is not asso- 
ciated with any stationary point in the angular ray distribution and is 
located in the intermediate fan of rays. Though theoretically originating 
as a catacaustic limiting ray within the glass, its caustic character is 
momentarily lost to external observation because of the dominant effects 
of the severe spread in refraction angles approaching the extinction ray. 
Beginning with Fig. 4e, this unnumbered J ray appears between ex- 
tinction ray 3 and the new caustic ray 4, such that both the caustic and 
the J ray can be expected to be rather less intense. While the 1- and 3- 
intercept caustics rotate upstream in the directions of smaller 64s with 
decreasing @s, the new 2-intercept caustic rotates in the opposite di- 
rection—downstream toward greater caustic angles. In addition, its 
radial position in the illuminating beam has moved farther out from the 
center of the sample, away from the J ray and toward the illuminating 
ray for the 3-intercept caustic. This outward displacement of its illu- 
minating ray and its retrograde rotation are two other properties which 
make the second 2-intercept caustic different from the first. 

When @ is decreased further to 34.5 degrees, the first 2-intercept 
caustic reforms from the J ray, along with yet another 2-intercept caustic. 
The illuminating ray for the newest caustic propagates just inside the 
I ray and emerges in the far field barely downstream from the re-emer- 
gent first 2-intercept caustic ray. Both these rays are of reduced intensity, 
since they originate within the band of rays which lose most of their light 
by refraction on the first interception with the boundary. As the taper 
is decreased still further, a continued folding of this intermediate fan 
takes place, and the three 2-intercept caustics separate in the far field. 
The illuminating rays for the second and third 2-intercept caustics move 
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away from the / ray. At the same time, the illuminating ray for the third 
caustic moves in toward the surface. The third 2-intercept caustic, like 
the second, is folded in a direction opposite that of the first 2-intercept 
caustic and shows an initial rotation toward smaller 6, s. 

One additional point should be noted. In Fig. 3e, soon after the second 
2-intercept caustic appeared, its point of emergence moves out along the 
profile with decreasing @ and becomes stationary at the “start” of the 
melt zone, i.e., the location where the sample begins its decrease in cross 
section. At the same time, it continues retrograde rotation. On the other 
hand, the first 2-intercept caustic reappeared as soon as the emergent 
point of the J ray moved downstream to the same point, so that the outer 
normal angle becomes constant along the surface and the internal 
catacaustic could emerge intact with only a change in direction. Subse- 
quently, as @ is decreased, the point of emergence of the first 2-intercept 
caustic moves upstream along the sample surface where the diameter 
is constant, while the emergent points of both the second and third 2- 
intercept caustics merge together at the start of the melt zone. Ulti- 
mately, both the 3-intercept and the first 2-intercept caustics approach 
extinction by internal reflection, the latter for a second time as its 
propagation angle approaches 0 degree. Consequently, their rates of 
rotation seem to accelerate. From 8 = 24.9 degrees (Fig. 3f) to 6 = 22.3 
degrees, both original caustics disappear by internal reflection, leaving 
only the second and third 2-intercept and the single 1-intercept caustics 
in the emerging light field. Figure 4f shows the complicated 2-intercept 
far-field light ray pattern prior to extinction, with many overlaps be- 
tween the various fans of light contributing to the structure of the ob- 
served illumination. Here the first caustic, ray 5, is once again the ex- 
treme upstream ray while all other caustics and extinction boundaries 
are spread out in the light regions behind it. The unnumbered / ray still 
lies between ray 3 and the second 2-intercept caustic, ray 4, such that 
all of the 2-intercept caustics may appear relatively less intense than 
either the 1- or 3-intercept caustics. At this point, the third 2-intercept 
caustic, ray 6, has reversed its initial upstream rotation and, like the 
second 2-intercept caustic, assumed a retrograde rotation with changing 
slope. In Fig. 5f, 6 = 24.9 degrees, the fans of light behind the 3-intercept 
caustic can be seen to start to broaden slightly just prior to the disap- 
pearance of this caustic, while by @ = 22.3 degrees the caustic ray is in- 
ternally reflected and only the overlapping fans, bounded by extinction 
boundary rays, and the one geometrical boundary ray, remain. The two 
remaining 2-intercept caustics (with the original caustic extinguished) 
continue to rotate toward the downstream direction. 

At this geometry, we have probably passed beyond the practical limit 
for most melt zone profiles drawn in a laser furnace. However, if it were 
possible to draw even blunter profiles, we would observe that the second 
2-intercept caustic angle would approach 90 degrees while the 1-intercept 
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caustic angle would approach 180 degrees. At the same time, the inter- 
nally reflected 3-intercept caustic would reappear, at 8 =~ 16.4 degrees, 
although somewhat altered. That is, following the two reflections on the 
first side of the sample and the initial reflection on the second side, the 
family forms an internal catacaustic which refracts out of the sample on 
its next interception with the second side. Since this is a fourth inter- 
ception overall, it represents the emergence of a 4-intercept caustic rather 
than a re-emergence of the original 3-intercept caustic. Further blunting 
of the profile causes the caustic to rotate in the retrograde direction to 
a Maximum caustic half-angle of about 50 degrees, somewhere between 
6 = 14.0 and 8 = 18.4 degrees. It then reverses direction and disappears 
once more by internal reflection, at 6 = 6.8 degrees, leaving only the 
1-intercept caustic, which would be very bright, and the second and third 
2-intercept caustics, the first of which would also have become bright. 
Intensification of the light associated with the second 2-intercept caustic 
results from the outward migration of the associated illumination ray. 
It eventually reaches a point so near the start of the melt zone surface 
as to fall incident on the now rapidly curving profile at an angle greater 
than the critical angle. Consequently it is wholly reflected and all its light 
is transmitted across the sample to refract out and form the caustic. 


Vv. SUMMARY 


The variations of the angles of various 1-, 2-, and 3-intercept caustics 
as functions of 6 for both Sample 4 and Sample 3 are given in Figs. 6a 
and 6b. Significant values are also listed in Table I. In this study, we have 
identified two additional 2-intercept caustics. Comparison of Figs. 6a 
and 6b and the tabulated results show that, with the exception of these 
two caustics, the results for the morphologically different samples are 
numerically quite similar. Figure 6 also offers a comparison with the 
results of the original study.! For all four samples, these results compare 
very favorably. Since in Fig. 6 we are comparing results from symmetric 
samples with the average values obtained from often rather unsymmetric 
samples, this agreement is quite remarkable.* It is worth noting that in 
Ref. 1 both the experiments and the original analysis identified only two 
2-intercept caustics for Sample 3. In fact, there are three such caustics 
present. The taper of Sample 3 was such that the first and third 
2-intercept caustics were barely separated and consequently appeared 
to the observer as one. 

Turning to the more general results, we see that, for a shallow taper, 
no light emerges. When 6 falls below 73 degrees, the 2-intercept caustic 
emerges, initially directed downstream toward the x-axis and along the 
surface at an angle, 04 ~ 190 degrees. As the taper becomes steeper 


* It indicates that, while melt zone asymmetry may have a strong influence on caustic 
asymmetry, it may have relatively little influence on determining the total caustic cone 
angle, 204. 
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Fig. 6—Plots of the limiting caustic half-angle, 64, vs the angle of the outer normal at 
the inflection point, 8. (a) For symmetric Sample 4. (b) For symmetric Sample 3. The 
numbered data points represent averaged values of the caustic angles measured on the 
four original samples described in the earlier study (Ref. 1). Heavy lines denote bright 
caustic; light lines denote dim caustic. 


(8 decreases), the caustic ray swings toward the upstream direction. It 
reaches a half-angle, 04, normal to the surface at 8 ~ 58 degrees and a 
64 = 90 degrees at a 8 ~ 51 degrees. Thereafter, its rate of rotation ac- 
celerates as it approaches extinction at 04 ~ 52 degrees or 6 ~ 45.7 de- 
grees. Recalling that this caustic is the one formed at the inflection point, 
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Table |—Significant values of limiting caustic half angle 


Caustic 

Sample 2-Intercept 3-Intercept 

unction 4 3 ae a —_4__— same 
Station 6° On) p° Onl p° oR” p° OR 


A A A A 


At Maximum 8 71.7 189.6 72.7 188.5 61.3 186.7 62.6 184.0 
At 64 1 Surface 58.5 118.9 58.0 115.0 385 1160 41.6 120.0 
At 04 1 X-Axis 51.0 90.0 51.2 90.0 28.4 90.0 29.8 90.0 
At Minimum 8 45.7 51.9 45.8 52.3 23.7 00.2 25.7 67.2 


2-Intercept Third 2-Intercept 
pe 9 1° p° gyr g° 0 3° p° 0 Be 
At Maximum 8 345 25 5 631.0 46.5 034.5 By 5 o1.0 45 5 


. 4, : d : 
At Minimum 0,4 23.4 0.0 22.8 0.0 29.6 55.5 26.9 45.3 
.0)* (90.0) (0.0) _ (90.0) 


At Minimum £8 23.4 0.0 22.8 0.0 0 
econd + Intercept 1-Intercept 
ce B° gU2° Bo Ok pe - 
At Maximum 6 40.7 40.0 38.4 480 43.2 TIeB 43.2 eB 
At Minimum 8 (0.0) (90.0) (0.0) (90.0) 0.0 180.0 0.0 180.0 


* Numbers in parentheses were extrapolated and not actually calculated. 


it should be noted that, at 6 ~ 37.5 degrees, the internal catacaustic ray, 
or / ray, again emerges. However, it is not seen as an external caustic until 
GB < 34 degrees. A different 2-intercept caustic does become visible when 
68 ~ 40 degrees. This caustic is not associated with the internal caustic 
formed by reflection at the inflection point. It also differs from the 
original 2-intercept caustic in that, as the taper increases, it rotates in 
the opposite direction, i.e., toward greater angles. It may also emerge 
initially as an intense caustic, but it becomes quite dim once the inci- 
dence angle of the illuminating beam falls below the critical angle. When 
68 < 16.7 degrees, this new caustic ray is again illuminated by a ray ini- 
tially at an angle greater than the critical angle and so it again becomes 
bright. The half-angle, 64, of this caustic approaches 90 degrees 
asymptotically as 6 — 0 degree. Referring again to the J ray, we see that 
its rotation is also initially retrograde as 6 decreases. However, when it 
again becomes the leading 2-intercept caustic ray, its direction of rotation 
reverses. From this point, until its extinction at 6 ~ 23 degrees, the re- 
emergent 2-intercept caustic is much less intense. Simultaneously, a 
third, dim, 2-intercept caustic appears which ultimately rotates in the 
direction of increasing caustic angle, like the second 2-intercept caustic 
that preceded it. 

Careful study of Figs. 6a and 6b and Table I shows that over most of 
its range above 6 = 46 degrees the 2-intercept caustic 0,4 vs 6 relationship 
is very nearly the same for both samples. However, this is not true of the 
second and third 2-intercept caustics, which differ from the first in a 
number of significant ways. First, the far-field ray trajectories fold in 
opposite directions. Second, both new caustics rotate to larger rather 
than smaller caustic angles with decreasing 8. Third, they are indepen- 
dent of the internal catacaustic originating at the inflection point. This 
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latter observation is significant, because while the functional forms of 
both samples are similar, the additional caustics are quantitatively the 
most highly differentiated. This differentiability is a result of their 
originating from rays initially incident far away from the inflection point. 
There they are more strongly influenced by other aspects of melt zone 
morphology than just 6. More of this will be discussed later in this 
paper. 

The 3-intercept caustic is also less involved with the inflection point 
and shows some differentiation between the two samples over its entire 
range (see Table I). Since its initial emergent caustic angle depends on 
the downstream surface geometry in much the same way as it did for the 
original 2-intercept caustic, it is not surprising that it too should result 
in an initial caustic half-angle, 6,4, significantly greater than 180 degrees, 
although for a 10-degree smaller value of 8. As shown in Table I, the final 
3-intercept caustic angles are quite different. These extinction angles 
appear at Gs about 22 degrees smaller than their respective 2-intercept 
caustic’s initial extinction 6 angles and at very nearly the same final 
extinction 6 angles as those of the re-emergent first 2-intercept caustic. 
Actually, the light rays associated with the 3-intercept caustic also 
eventually make a reappearance. As shown in Fig. 6a for Sample 4, a 
bright 4-intercept caustic resulting from the internally reflected 3-in- 
tercept caustic does emerge briefly headed in the upstream direction and 
then disappears, all at 6s too small to be physically significant. 

There is a diacaustic which emerges when £ falls below the critical 
angle. This caustic is formed by the light which refracts from the surface 
on its first interception. The J ray which forms the internal catacaustic 
and, ultimately, the 2-intercept caustic, also attains an external extre- 
mum when it forms the 1-intercept caustic. Because little of the hight 
in the illuminating rays is reflected, once the 1-intercept caustic appears, 
many (but not all) of the 2-intercept rays which appear at Gs below the 
critical angle are quite dim. Consequently, over most of this range the 
2-intercept caustics are also dim. This is indicated by the light line 
weights used to represent them in Fig. 6. 

While the second and third 2-intercept caustics are unique in that they 
never propagate downstream or toward the fiber axis, the 1-intercept 
caustic is also unique because it always propagates downstream toward 
the fiber axis. Consequently, the 1-intercept caustic either follows the 
surface or reflects off the fiber at some station downstream, such that 
its fundamental rotation as a function of 8 depicted in Fig. 6 becomes 
reversed. Since this caustic depends only on 8, its angle is a unique 
function of @ for all melt zone profiles, 


6, = 180° — 8 + arc sin (n sin 8), 


where n is the index of refraction. Hence, for n = 1.46 it always originates 
following the surface downstream at an angle of 226.8 degrees at 6 = 43.2 
degrees and then rotates toward a limit of 180 degrees. 
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We conclude that the most useful caustics for studying fiber-drawing 
melt zones are the first 2-intercept caustic and the 3-intercept caustic. 
Between them, these cover a broad range of melt zone geometries. Except 
at very large @s, the first 2-intercept caustic depends mostly on the melt 
zone profile near the inflection point and does little to distinguish be- 
tween samples. In contrast, the 3-intercept caustics readily distinguish 
between the different samples. Sample 3 is on the average larger by about 
1.5 degrees in 6 than Sample 4, a difference originating from variations 
in geometry near the shoulder and heel of the profiles. Figure 7 presents 
plots of both sample profiles scaled to a common @ of 51.3 degrees. It can 
be seen that Sample 4 is significantly larger than Sample 3 both up- 
stream, at the shoulder, and downstream, at the heel of the profile. In 
addition, the inflection point of Sample 4 is at a greater radius, and up- 
stream, of that for Sample 3. Since the 8 angles are the same, these dif- 
ferences produce a less than 0.7-degree change in the 2-intercept caustic 
angle 04!. However, the 3-intercept caustic involves reflections at the 
shoulder region of the melt zone and a refraction at the heel region 
downstream, and it manifests a change in the caustic angle 6? more than 
three times that in 04". As reported in Ref. 1, this difference is detectable 
experimentally. 

Though less intense and of rather limited range, the second and third 
2-intercept caustics are also potentially useful as a means of studying 
melt zone geometries. These are the most distinguishable caustics be- 
cause they involve light which is initially incident either at the shoulder 
of the melt zone, as with the second 2-intercept caustic, or at the heel 


B = 51.3° 


=== 





U1 
9.21 = 9091°— CP 0,° = 144.40°) — — 
(0, 7" = 91.61°)—— 
6, = 91.25°— 
(6, = 91.779) —— 


Fig. 7—Superimposed symmetric profiles from Sample 4 (dashed lines) and Sample 
3 (solid lines) showing limiting ray paths for both 2- and 3-intercept caustics, one side only. 
Here, 8 = 51.3 degrees for both samples with data from Sample 3 shown in parenthe- 
ses. 
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of the melt zone as with the third. Both caustics exhibited numerically 
greater angles for Sample 3 than for Sample 4 whose shoulder and heel 
profiles induce greater rotations in the reflected rays. Indeed, the aver- 
aged experimental results for these caustics from Sample 3 agree rea- 
sonably well with the values computed from the symmetric Sample 3 
profile data and rather poorly with the values computed from the sym- 
metric Sample 4 profile data, Figs. 6a and 6b, respectively. These caustics 
are the only primary ones still visible for 6 < 22 degrees, and together 
with the 1-intercept caustic provide some means of studying extremely 
blunt melt zones. 

It should be realized that, since each of these six caustics under dis- 
cussion involves a refraction from the sample, they all may depend on 
the index of refraction, n, as well as on the surface geometry. In Ref. 1, 
another caustic generated wholly by reflection and dependent only on 
G8 was discussed. That caustic is the externally illuminated equivalent 
of the internal catacaustic that forms at the inflection point and can 
easily be generated for all physically reasonable melt zone profiles. Since 
it does not depend on n, information from this externally illuminated 
catacaustic may be used to separate the 6 dependence of the data ob- 
tained from an appropriate internally illuminated caustic. This could 
provide otherwise unavailable information concerning the index of re- 
fraction. For example, consider a melt zone sample of unknown n ex- 
hibiting a first 2-intercept caustic at an angle 04! somewhere between 
52 and 110 degrees. It may be matched with a computer analysis of the 
present data at a 6 determined from direct measurement of the exter- 
nally illuminated catacaustic angle, 04*". Adjustment of the n value used 
in the computer analysis to yield an equal theoretical 04! value should 
provide a good estimate of the unknown index of refraction. The quality 
of this estimate would depend on how uniform n was across the sample 
and how close the propagation vector of the emerging caustic was to 
tangency with the surface. Obviously, if 3 is such that the caustic emerges 
normal to the surface, it can provide no information about n. 
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Caustic Patterns Associated With 
Melt Zones in Solidified Glass Samples— 
Part Il: Asymmetric Cases 


By J. B. SEERY, T. D. DUDDERAR, and P. G. SIMPKINS 
(Manuscript received March 17, 1978) 


Two types of geometrical asymmetries have been analyzed numeri- 
cally to determine their influence on the externally visible caustic 
patterns from a drawdown zone. The first of these asymmetries involved 
axial displacement of identical opposing profiles of the drawdown zone, 
while the second involved opposing profiles of differing slope positioned 
with their inflection points in axial alignment. The results of these 
studies are presented graphically as an aid to interpreting the asym- 
metries of actual caustic patterns generated by illuminating real 
drawdown zone samples in optical fiber drawing. 


|. INTRODUCTION 


In the present study, we demonstrate how deviations from rotational 
symmetry in the melt zone give rise to asymmetries in the resulting 
caustic pattern. In Ref. 1, rotationally symmetric profiles of melt zones 
were generated by averaging measurements of data taken from solidified 
melt zone samples. These data provided symmetrically idealized data 
sets for the study of the two principal types of caustics as functions of 
the rate of change in the cross section. Changes in the maximum rate of 
change in the cross section were simulated by scaling* the data sets, and 
the analysis was carried out using the algorithms developed earlier in 
Ref. 2. These algorithms describe families of 2- and 3-intercept caustics 
as shown in Fig. 1a, where the scale factor is unity. The incident angle, 
G, for an axial light ray at the inflection point, J, is 52.9 degrees on both 
sides of the cross section. However, in Ref. 2 it was shown that the 
samples were usually asymmetric, resulting in pronounced asymmetric 
caustic patterns, especially when the caustic rays emerged nearly tangent 
to the surface. 


* That is, by scaling the axial and transverse data values differently with the scale factor, 
SF = (axial scale)/(transverse scale). 
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Fig. 1—Drawdown profiles for a melt zone with an average y of 1.323 and a preform 
radius, R, of 0.1217 in. (3.09 mm). The data scale factor, SF, is 1. The rays labeled D rep- 
resent the 3-intercept caustic trajectories and the rays labeled 1 represent the first 
2-intercept caustic trajectories for the following cases: (a) Symmetric drawdown profiles 
with y = 1.323 (8 = 52.9 degrees) on both sides and no shift (A/yR = 0.0, 6y = 0 percent). 
(b) Asymmetric drawdown profiles with y = 1.323 (6 = 52.9 degrees) on both sides and 
an axial shift of A = 0.05-in. (1.27 mm) of the left side ahead of the right side (A/yR = 0.311, 
A/R = +0.41). Here dy = 0 percent. (c) Asymmetric drawdown profiles with y = 1.455 (8 
= gece 5 degrees) on the left side and = 1.1907 = 50.0 degrees) on the right side (average 

= 1.323, by = +10 percent). Here A/yR = 0.0 


Two types of asymmetric drawdown zones were studied, based upon 
the symmetric data set generated from the fourth sample.!? In the 
simplest case, the asymmetry was introduced by axially shifting the 
profile data for one side of the cross section with respect to the other. An 
asymmetric cross section with the caustic rays is shown in Fig. 1b, where 
the relative axial shift between the two profiles is 41 percent of the 
preform radius* and the scale factor is 1.0. The asymmetry produces 
significant counterclockwise rotations in the propagation directions of 
the emerging caustic rays when viewed with the downstream profile as 
shown on the left. 

A second type of asymmetry was produced by scaling the opposite 
sides differently. In this study, the inflection points of the opposite 
(differently scaled) profiles were kept in axial alignment so that the re- 
sulting shift in the caustic ray pattern would be due solely to the dif- 
ference in the slopes, as shown in Fig. 1c. There the scale factors of op- 
posite sides are 0.9 and 1.1. These produce counterclockwise rotations 
of the emerging caustic rays comparable to those shown in Fig. 1b and 
toward the elongated profile as plotted. 

In addition, for both types of asymmetry, the overall scaling was 
changed so that the effects of these asymmetries could be assessed for 
greater or lesser average gradients than those of the original data. These 
scale factors and associated average slopes, y, and incident angle values, 
Ba, are tabulated in Table I. Here y is the average of the slopes of the two 
outer normals at the inflection points, dx/dy;, and B4 = —arc tan y. 

The next two sections present the results from the computer simula- 
tions including diagrams of significant cases for both types of asymme- 


* R = 0.1217 in. or 3.09 mm. 
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Table |—Scale factors and average slopes and angles 


Average Outer Average Incidence 

Scale Factor Normal Slope y Angle 8,4 (degrees) 
1.300 1.720 59.8 
1.200 1.588 57.8 
1.000 1.323 52.9 
0.775 1.025 45.7 
0.700 0.926 42.8 
0.520 0.688 34.5 
0.450 0.595 30.8 
0.340 0.450 24.2 


tries. The reader who requires a less detailed synopsis of these results 
is referred to the discussion portions of these sections and to Section V, 
Summary. 


Il. SHIFT ASYMMETRY 


The effects of axial misalignments in the profiles of the drawdown zone 
were studied by computing the ray paths for examples with the profiles 
of opposite sides shifted various amounts, A. The results of these com- 
putations provide a complete description of the caustic patterns for an 
asymmetric drawdown zone with a range of profile shifts for each y given 
in Table I. 

To compare the varied scalings, an asymmetry parameter, A/yR, was 
established, where R is the preform radius. The asymmetry was varied 
from 0 < |A/yR| < 1.20, which represents a broader range than is ever 
likely to actually occur in fiber drawing. The specific incremental changes 
in asymmetry were chosen to illustrate what occurs as the asymmetry 
increases. 

An extreme example of the effects of this type of asymmetry can be 
seen by comparing Fig. 2a with Fig. 2b, which has the same average slope 
but a large asymmetry. Emergent 2-intercept caustics are labeled nu- 
merically to indicate which of the family they represent; the 3-intercept 
caustics are labeled D. The unlabeled rays represent caustic rays of the 
1-intercept family which emerge at the inflection point for ys less than 
0.939. In Fig. 2b, the 3-intercept caustics have rotated counterclockwise, 
or in a positive direction, toward the leading side. Extra 2-intercept 
caustics (labeled 2’, 2’, 3’) have merged on the leading side, while all 
2-intercept caustics have disappeared from the trailing side. Those 
2-intercept caustics on the +A side have rotated clockwise, in a negative 
direction, away from the leading side. 


2.1 Graphical results 


Figures 3a to 3d are simplified versions of the format used in Figs. 1 
and 2 which show both the internal ray paths and the external caustic 
trajectories. In these simplified diagrams, the internal ray paths have 
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Fig. 2—Drawdown profiles for a melt zone with an average y of 0.688 (SF = 0.52). The 
rays labeled D represent the 3-intercept caustic trajectories, while the rays labeled 1, 2, 
3, etc., represent the first, second, third, etc., 2-intercept caustic trajectories for the fol- 
lowing cases; (a) Symmetric drawdown profiles with y = 0.688 (8 = 34.5 degrees) on both 
sides and no shift, A/yR = 0.0, 6y = 0 percent. (b) Asymmetric drawdown profiles with 
+ = 0.688 (8 = 34.5 degrees) on both sides and a shift, A = 0.075 in. (1.905 mm) of the left 
side ahead of the right side (A/yR = +0.896, A/R = +0.616). Here, 5y = 0 percent. 
(c) Asymmetric drawdown profiles with y = 0.894 (8 = 41.8 degrees) on the left side and 
heen (8 = 25.7 degrees) on the right side (y = 0.688, éy = +30 percent). Here, 
A/yR = 0.0. 


been eliminated, and the drawdown profiles have been reduced to tan- 
gent lines at the inflection points. The profiles, shown as thick lines, meet 
in the symmetric case and progressively move farther apart to depict 
increasing +As. The external caustic trajectories are shown schematically 
as rays radiating from the point A = 0. The 3-intercept caustics are again 
labeled D and, in the unusual case of a second 3-intercept caustic, D2 
is used. Whenever the ray which internally reflects from the inflection 
point, J, is refracted from the opposite side without forming an external 
caustic, it is shown by an unlabeled dashed line. In Figs. 3a to 3d, the 
diagrams are drawn with the leading side as the left profile, consistent 
with Figs. 1b and 2b. We then define a positive, or forward, rotation of 
caustic angles to be counterclockwise, or toward the leading side. 

We begin with gradually tapered profiles that emit little light. As the 
taper is increased, we observe an increase in the amount of light emitted 
and in the complexity of the caustic structure. The first case, with y = 
1.72, is shown in Fig. 3a. Both caustics present initially rotate in a forward 
direction as the asymmetry increases. However, the 3-intercept caustics 
quickly disappear from the trailing side by internal reflection. The 2- 
intercept caustics continue to rotate in a positive direction on the trailing 
side, but on the leading side they reverse direction and rotate in a neg- 
ative direction at an asymmetry A/yR of about 0.40. The situation is 
similar for y = 1.588, not shown. The 3-intercept caustics also rotate 
forward. Since the emergent angle for the symmetric case is smaller this 
time (163.48 degrees as opposed to 172.66 degrees), the caustic does not 
internally reflect as soon. The 2-intercept caustic initially rotates for- 
ward. This time, the direction reversal on the leading side occurs earlier 
at a A/yR of 0.33 and is more pronounced. Reducing ¥ to 1.323 does not 
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Fig. 3—Diagrams of external caustic trajectories for drawdowns of varying asymmetry 
due to axial shifts, A/yR, of varying amounts as follows: (a) For y = 1.720 (6 = 59.8 degrees), 
A/yR = +0.000, +0.118, +0.236, +0.354, +0.472, 40.596. (b) For y = 1.025, (@ = 45.7 de- 
grees), A/yR = £0.000, +0.199, +0.410, 40.596, 0.801. (c) For y = 0.926 (8 = 42.8 degrees); 
A/yR = +£0.000, +0.223, +0.441, 40.664, +0.888. (d) For y = 0.450 (6 = 24.2 degrees); 
A/yR = +£0.000, +0.183, +0.365, +0.548, +0.730, +0.913. Here the sides of the melt zone 
are represented by heavy lines sloped 6 degrees below horizontal and shifted vertically 
in proportion to the axial shift, A. The external caustic trajectories are shown as rays em- 
anating from a common center at their appropriate angles and labeled with D or D2 to 
indicate a first or second 3-intercept caustic and 1, 2, or 3, etc. to indicate a first, second 
or third, etc. 2-intercept caustic. Dashed lines represent emergent J rays which do not form 
caustics. 
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alter the situation significantly, except that both caustics are present 
for the entire range of asymmetry. In all cases, the rotation of all caustics 
on the trailing side has been consistently positive. On the leading side, 
the 2-intercept caustic’s forward rotation reverses at a A/yR of only 0.25. 
At the same time, a reversal in the forward rotation of the 3-intercept 
caustic also becomes apparent at an asymmetry of 0.75. 

Figure 3b shows the case of - = 1.025, where 8 = 45.7 degrees, i.e., very 
close to the 2-intercept caustic’s extinction angle. This caustic emerges 
at a minimum angle, 0, for the symmetric case, and progressively forms 
larger angles as A increases; i.e., it rotates negatively on the leading side 
and positively on the trailing side. As discussed in Part I,! the rays which 
are only incident once on the first side form a catacaustic at the inflection 
point. If the catacaustic ray then refracts from the second side, it is 
usually seen as the first 2-intercept caustic ray. However, the dashed 
lines shown in the diagrams for the smaller asymmetries in Fig. 3b depict 
emergent rays which form internal catacaustics at the inflection points 
but are not seen as external caustics. We call these rays the J rays. Their 
angles are at least half a degree different from the 2-intercept caustic 
angles. Once the asymmetry has increased to a A/yR of 0.80, however, 
these two rays have recombined. The 3-intercept caustic rotates in the 
forward direction over the asymmetry range studied. 

Continuing further, the symmetric case with y = 0.926 has no emer- 
gent 2-intercept caustic (see Fig. 3c). However, once the asymmetry has 
increased to A/yR = 0.441, the original 2-intercept caustic has re- 
emerged and subsequently rotates to larger 6 angles on both sides. Again, 
the (noncaustic) J ray is initially distinct from the 2-intercept caustic 
ray, and they merge as the asymmetry increases. It should be noted that 
the J ray initially emerges on the forward side without forming a caustic 
before the 2-intercept caustic emerges. The 3-intercept caustic continues 
to rotate in a forward direction up to the largest calculated asymmetry, 
at which point it reverses direction on the forward side. 

When is 0.688, there are three 2-intercept caustics for the symmetric 
case caused by multiple-folding in the fan of emerging light. This is the 
case shown earlier in Fig. 2a (A/yR = 0.0) and Fig. 2b (A/yR = 0.896). 
The folding geometry is illustrated in the left and center diagrams in 
Fig. 4a. Here the caustic rays are labeled 1, 2, 3, etc., consistent with the 
present convention, while the bounding rays are labeled L1, L2, L3, etc. 
The lower L numbers represent bounding rays originally propagating 
nearest the surface of the sample, with successive higher numbers pro- 
ceeding inward. The left diagram in Fig. 4a shows that the fan of light 
which lies between bounding rays L3 and L4 is folded to form the three 
2-intercept caustics with the first and third very nearly superimposed. 
As the asymmetry is increased, most of the 2-intercept caustics disappear 
on the trailing side while several additional ones appear on the leading 
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side. All these caustics rotate in a negative direction except for the 
original 2-intercept caustic on the leading side, which remains stationary. 
This unusual circumstance occurs because the J ray does not form an 
external caustic until its point of emergence has moved upstream of the 
melt zone to where the radius is constant. 

In Fig. 2b, we can see that, since the profile on the initial incident side 
does not change, the internal trajectory of the J ray remains the same. 
Consequently, as long as caustic ray 1 refracts from the leading side its 
external angle will be constant, regardless of the magnitude of A. On the 
trailing side, the internal J ray is incident along that portion of the melt 
zone profile for which the radius is changing very rapidly; therefore, al- 
though it emerges for asymmetries of less than 0.3 it does not form an 
external caustic. However, this J ray reappears along with a first 
2-intercept caustic when the asymmetry A/yR reaches 1.2. In addition, 
a new 2-intercept caustic emerges on the leading side. This fourth 2- 
intercept caustic is associated with the closing of the break between the 
L1-L2 fan of light and the L3-L4 fan of light, illustrated in the center 
diagram in Fig. 4a. Consequently, this new caustic is folded downstream 
in the same way as the original first 2-intercept caustic. It originates from 
a coaxial ray initially propagating down the sample at a greater radius 
than the second 2-intercept caustic, but well inside the ray which origi- 
nates the 3-intercept caustic, as shown in Fig. 2b. 

Several other minor 2-intercept caustics appear and are labeled 2’, 2”, 
and 3’ in Fig. 4a center. These lie very close—in terms of both originating 
axial rays and emergent angles—to the second or third 2-intercept 
caustics. Each is associated with another fold in the fan of light.* The 
3-intercept caustics in this case (y = 0.688) initially rotate positively. 
Eventually, on the leading side this caustic reverses its rotation at 
A/yR =~ 0.6 and then extinguishes at an asymmetry of about 0.8. On the 
trailing side, it continues to rotate in a forward direction. The additional 
2-intercept caustics also appear in the case where y = 0.595. Again the 
first 2-intercept caustic, when it emerges, remains stationary. The others 
in this family all rotate in the negative direction. The first, second, and 
third 2-intercept caustics emerge on the leading side for all asymmetries 
and are extinguished when the asymmetry is between 0.83 and 1.1 on 
the trailing side. Minor caustics associated with the second and third 
major caustics were found on the trailing side, but not on the leading side. 
The fourth caustic emerges at greater asymmetry than in the previous 
case of y = 0.688 and again only on the leading side. This case, with y 
= (0.595, is of particular interest because it exhibits for the first time a 
second 3-intercept caustic. The new caustic is associated with a folding 


* It should be noted that, since the algorithm computes the ray trajectories for an equally 
spaced sequence of axial rays, finding these minor perturbations in emergent ray angles 
depends somewhat on mene ray spacing chosen. 
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(b) 

Fig. 4—(a) Schematic diagrams showing the external limiting rays for the 2-intercept 
light paths. The drawdown melt zone profiles are omitted, but the orientation is the same 
as in Figs. 1 and 2. The vertical arrow represents the incoming beam of collimated light 
illuminating the sample which in turn deflects the light into the plotted ray paths in the 
far field. Here, the solid lines represent the caustic trajectories and the broken lines rep- 
resent the bounding rays limited by internal reflection (see Ref. 1). The circumferential 
arrows follow the continuous fan of light from the outermost bounding ray to the innermost 
bounding ray including all caustic rays. The bounding rays are numbered in sequence, 
beginning with L1 as the ray which originates nearest the surface of the sample and in- 
creasing inward L2, L3, etc. The caustic rays themselves are numbered 1, 2, 3, etc. to in- 
dicate that they represent the first, second, or third, etc. of the 2-intercept caustics. The 
asymmetries are as follows: Right—Symmetric, with y = 0.688 (8 = 34.5 degrees) on both 
sides and A/yR = 0.0, 6y = 0 percent. Center—Asymmetric with y = 0.688 (6 = 34.5 de- 
grees) on both sides and a shift A = 0.075 in. (1.905 mm) of the left side ahead of the right 
side (A/yR = £0.896, A/R = £0.616). Here, dy = 0 percent. Left Asymmetric with y = 
0.894 (6 = 41.8 degrees) on the left side and y = 0.482 (8 = 25.7 degrees) on the right side 
(y = 0.688, dy = +30 percent). Here, A = 0.0. (b) Schematic diagram showing external 
limiting rays for the 3-intercept light paths. The same ray identifications apply to the di- 
agram as apply to the diagrams in Fig. 4a except that the caustics are numbered D and 
D2, representing the original and second 3-intercept caustics. The asymmetry is given 
by y = 0.595 (8 = 30.8 degrees) on both sides and a shift, A = 0.08 in. (2.032 mm) of the 
left side ahead of the right side (A/yR = 41.105, A/R = +0.657). Here, éy = 0 percent. 


in the fan of light in the direction opposite that of the original 3-intercept 
caustic. It is formed from a ray initially propagating down the fiber at 
a radius smaller than that ray which forms the first 3-intercept caustic. 
The folding pattern is shown in Fig. 4b, where the new ray is labeled D2. 
The new caustic emerges only for large asymmetry and only on the 
leading side. Meanwhile, the original 3-intercept caustic, as before, ro- 
tates initially in a positive direction but eventually reverses with in- 
creasing asymmetry. 

The last case studied, y = 0.450, is shown in Fig. 3d. Here the first 
three 2-intercept caustics emerge over the whole range of asymmetries. 
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The second and third caustics rotate in a negative direction; the first is 
again stationary. The fourth caustic emerges on the leading side and only 
for the largest asymmetry. The first 3-intercept caustic appears only on 
the trailing side and rotates downstream with increasing asymmetry. 
The second 3-intercept caustic appears alone on the leading side and only 
for larger asymmetries. 


2.2 Discussion 


The diagrams in Fig. 3 illustrate the far-field caustic response to in- 
creasing axial asymmetry A/yR for fixed y. We now present a general 
quantitative description of the caustic behavior due to changing geom- 
etry. Figure 5 shows the propagation angles, 0, of the major caustics as 
functions of A/yR for various y. The values, 6_, for the trailing side are 
plotted to the left and the values, #1, for the leading side are plotted to 
the right of the zero asymmetry line. Figures 5a and 5b show the first 
2-intercept and the 3-intercept caustics, respectively. The behavior of 
the remaining major 2-intercept caustics, the second, third, and fourth, 
are plotted in Fig. 5c. Each curve on these plots represents a fixed outer 
normal slope y over the range of asymmetry tested. Those curves with 
arrowheads at the ends indicate caustics which emerge for higher 
asymmetry but were not calculated. The absence of an arrowhead in- 
dicates that the caustic ceases at or just beyond that point. 

Other measures of caustic behavior are given in Fig. 6, for cases where 
the caustics emerge on both sides of the drawdown. In terms of actual 
three-dimensional drawdown samples, the caustic trajectories form cones 
of light whose total included angle is 6_ + 6. The average sum, Og = '(6_ 
+ 64), is plotted to show the half cone angle as a function of asymmetry. 
The average difference, 6p = 14(6_ — 04), is plotted to show the rotation 
of the cone as a function of asymmetry. Figures 6a and 6c refer to various 
2-intercept caustics, while Fig. 6b refers to the 3-intercept caustics. 

Consider the first 2-intercept caustic shown in Figs. 5a and 6a. In Fig. 
5a, starting with large y, on the leading side 04! decreases, reaches a 
minimum, and slightly increases as A/yR increases. As the profile be- 
comes blunter, the minimum becomes more pronounced and moves 
toward the symmetric case, A/yR —> 0. When vy = 0.926, the drop in 0¥1 
as A/yR — 0 becomes so large that the caustic is extinguished for small 
asymmetries and only the ends of the curve remain. This means that, 
although the first 2-intercept caustic is extinguished by internal re- 
flection in the symmetric sample of y = 0.926, the development of suf- 
ficient asymmetry causes an almost simultaneous reappearance on both 
sides of the sample. Figure 6a also shows that the average angle, 05/1 of 
the first 2-intercept caustic decreases as A/yR — 0. This decrease is 
greatest for profiles with the lowest ys. In other words, the caustic cone 
angle increases as the asymmetry increases, especially in blunt samples. 
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Fig. 5—Plots of the 6 angles of the caustic trajectories for various ys as functions of the 
axial shift, A. Here the values of the angles, 04, of the caustics emergent on the leading side 
are plotted on the right and the values of the caustic angles, 6_, of those emergent on the 
trailing side are plotted on the left. These results are arranged as follows: (a) The first 
2-intercept caustic angles, designated 6¥!, and the J ray propagation angle, 6/, vs A/yR. 
(b) The first and second 3-intercept caustic angles, 0? and 0”2, vs A/yR. (c) The second, 
third, and fourth 2-intercept caustic angles, 042, 043 and 0U4, vs A/yR. 


At the same time, the 04! vs A/yR curves in Fig. 6a show that the first 
2-intercept caustic cone, which is initially symmetric about the drawing 
axis, first rotates toward the leading side, reverses, and then rotates back 
toward the axis as the asymmetry increases. 

The 3-intercept caustic exhibits similar behavior, as shown in Figs. 
5b and 6b, although there are significant differences. Again, there is a 
minimum in the caustic angle vs asymmetry curve that appears on the 
leading profile side of the 0? vs A/yR plot. However, the minimum does 
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Fig. 6—Plots of the averaged sums, Os = 14(6_ + 04), and the averaged differences, #p 
= 1,(6_ — 0), vs the magnitude of the axial shift | A|/yR for various ys. These results are 
arranged as shown in 6a, 6b, and 6c. (a) The first 2-intercept caustic angles, 01, averaged 
sums and differences vs | A|/yR. (Figs. 6b and 6c on following pages.) 


not move toward the symmetry axis as y decreases. Also, except in the 
region near the extinction points, the 6? vs A/yR plots are morphologic- 
ally similar. In Fig. 5b, the magnitude of the depression in the curve still 
grows as y decreases until, by 7 = 0.450, the caustic is extinguished on 
the leading side. Figure 6b shows that the caustic cone half-angles, 02, 
are less influenced by the asymmetry than they were for the 2-intercept 
caustics (Fig. 6a). In addition, the 3-intercept caustic cone rotates more 
rapidly toward the leading side before it reverses direction, as shown by 
the 64 curves in Fig. 6b. The behavior of the second, third, and fourth 
2-intercept caustics is illustrated in Figs. 5c and 6c. These caustics are 
distinct because their directions of rotation are opposite those of the first 
two major caustics. This backward rotation, with increasing A/yR, of 
the second and third 2-intercept caustics parallels the reverse #4 rotation 
with increasing y noted in the earlier “symmetric” study.! At the same 
time, the 6s values in the second and third 2-intercept caustics remain 
almost constant, indicating that the shapes of these caustic cones are 
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Fig. 6(b)—The 3-intercept caustic angles, 62, averaged sums and differences 
vs | A]/yR. 


almost independent of the asymmetry, except near the extinction 
points. 


lil. DIFFERENTIAL SLOPE ASYMMETRY 


The effects of asymmetric profiles of differing slopes were studied by 
changing the scaling of opposite sides of the profile by equal positive and 
negative percentages. These changes produce equal percentage changes 
in the slopes of the outer normals to the profiles, dx/dy. As shown in Part 
I,! the outer normal slope defines a unique incidence angle and is the 
most significant single parameter controlling the formation of the 
caustics. Therefore, we can define asymmetry by a + change in yj, 
1.e., Oy. 

An extreme example of the effects of this type of asymmetry is found 
in the comparison between Figs. 2a and 2c where in the latter 6y = +30 
percent. It can be seen that the external trajectories of the 3-intercept 
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Fig. 6(c)—The second and third 2-intercept caustic angles, 042 and 043, averaged sums 
and differences vs | A]/yR. 


caustics have rotated in the direction of increasing y, i.e., downstream 
on the shortened side and upstream on the lengthened side, which will 
be referred to as a forward (positive) rotation. At the same time, the 
external 2-intercept caustics have disappeared from the side of dimin- 
ished y, while an additional 2-intercept caustic, the fourth, appears on 
the opposite side. Like the 3-intercept caustic rays, the first and third 
2-intercept caustic rays have rotated forward, while the direction of the 
second ray is unchanged. Indeed, only the insignificant 1-intercept 
caustics exhibit a backward or negative rotation. A better understanding 
of this behavior can be obtained only in the context of a range of changing 
asymmetries for the various average ys in Table I, which are discussed 
below. 


3.1 Graphical results 


A graphical description of the effects of this asymmetry covering a 
range of typical average ys is presented in Fig. 7. As in Fig. 3, these dia- 
grams are simplified versions of the earlier formats showing only the 
external caustic trajectories. In most cases, the range of asymmetry, 67, 
is from +0 to +30 percent. These represent a much greater range of 
asymmetry than might be expected to arise with actual samples. The 
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Fig. 7—Diagrams of external caustic trajectories for drawdowns of varying asymmetry 
due to percentage differences in y, dy, of varying amounts as follows: (a) y = 1.720 + 0%, 
+ 5%, + 10%, + 20%, + 30%, + 40%. (b) y = 1.025 + 0%, + 1%, + 10%, +20%, + 30%. 
(c) y = 0.688 + 0%, + 10%, + 20%, + 25%, 4+ 30%, + 35%. Here the sides of the melt zone 
are represented by heavy lines sloped 6 degrees below horizontal. The external caustic 
trajectories are shown as rays emanating from a common center at their appropriate angles 
and labeled D to indicate a 3-intercept caustic and 1, 2, or 3, etc. to indicate a first, second, 
or third, etc. 2-intercept caustic. Dashed lines represent emergent J rays which do not form 
caustics. 
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specific incremental changes in asymmetry shown are not uniform, but 
were chosen to illustrate what occurs as the asymmetry increases. In 
every case, the diagram is drawn with the side of reduced y to the left 
so that a forward (positive) rotation of caustic rays is consistently 
counterclockwise on the diagram. In general, on the decreasing y side, 
6 increases and on the increasing y side, 0 decreases with increasing 
asymmetry. 

The first two cases, for which y = 1.720 (Fig. 7a) and 1.588, are similar; 
the rotation of all the caustic rays is forward and the 2-intercept caustic. 
on the right-hand, or decreasing , side disappears by internal reflection. 
It initially propagates at a greater angle for y = 1.720 than it does for 
y = 1.588, and its rotation due to increasing asymmetry is positive. 
Therefore, the right-hand 2-intercept caustic extinguishes at a 6y of only 
+10 percent for the case of y = 1.720, but must reach a 6y of £14 percent 
before it extinguishes for y = 1.588. Decreasing y to 1.323 reduces 6 for 
all trajectories and permits forward rotations of both caustic rays on both 
sides for asymmetries up to 6y = +80 percent without loss by internal 
reflection. | 

Internal reflection can also cause a caustic ray to disappear when its 
propagation direction swings too far upstream (6 decreasing). This is 
shown in Fig. 7b where the 2-intercept caustic ray on the increased y side 
disappears as soon as the asymmetry begins to develop, because it 
emerges near the extinction region.! Very little asymmetry is then re- 
quired to rotate the left-hand ray below the critical angle. 

The case of y = 0.926 lies within the range for which no 2-intercept 
caustic rays emerge from a symmetric drawdown. However, an asym- 
metry of +24 percent yields an external 2-intercept caustic ray on the 
side of reduced y. Hereafter, the ray rotates in the forward direction, as 
do the 3-intercept rays. It is interesting to note that the J ray initially 
emerges on the reduced y side at +22 percent asymmetry without 
forming an external caustic and becomes the externally visible caustic 
ray at dy = +24 percent. A similar ray, initially incident at the inflection 
point on the reduced y side as a catacaustic ray, emerges even earlier on 
the increased y side, at an asymmetry of dy = £20 percent. However, 
within the range of asymmetries considered, this ray never develops into 
an external caustic. 

Reducing the y to 0.688 creates a far more complicated structure even 
for the symmetric case, for which there are three 2-intercept caustics 
associated with multiple folds in the fan of emerging light. As shown 
earlier in Figs. 2a and 2c and further in Fig. 7c, the rotation is in general 
forward, although the second 2-intercept caustic appears to be relatively 
less responsive. On the decreased y side, the fold in the fan of emerging 
light disappears by éy = +10 percent, leaving the J ray still emergent. 
No first or third 2-intercept caustics are formed. At dy = +25 percent, 
even the J ray has disappeared. At 6y = +30 percent, the second 
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2-intercept caustic is extinguished on the reduced y side and a fourth 
2-intercept appears on the opposite side. A better understanding of this 
new caustic, which is the only one that rotates backward as shown by the 
last two diagrams of Fig. 7c, may be obtained from consideration of the 
left and right diagrams of Fig. 4a. As mentioned above, the left diagram 
in Fig. 4a shows that the fan of light which lies between bounding rays 
L3 and L4 is folded to form the three 2-intercept caustics with the first 
and third very nearly superimposed. The right side of Fig. 4a shows the 
same situation on the increased y side for the asymmetric case repre- 
sented by Fig. 2c with the folds of the first and third 2-intercept caustics 
widely separated. The break in the 2-intercept fan of light shown lying 
between bounding rays L2 and L3 in the left diagram in Fig. 4a (which 
separates the initial fan L1-L2 from the folded fan L3-L4) is closed in 
the right diagram by the new or fourth 2-intercept caustic. This new 
caustic is folded downstream in the same way as the original first 
2-intercept caustic and originates from a coaxial ray initially propagating 
axially down the sample at an even greater radius than the second 
2-intercept caustic but well inside the ray which originates the 3-inter- 
cept caustic (see Fig. 2c). 

Ata further reduction of scale to a y of 0.595, all rays rotate forward 
due to increasing asymmetry, with the first 2-intercept the fastest and 
the second 2-intercept the slowest. In this case, the former disappears 
for the increased y side at an asymmetry of almost +30 percent, while 
the fold in the fan of light separating the first and third 2-intercept 
caustic rays on the opposite side is almost completely suppressed by an 
asymmetry of only +20 percent. 

A final case, at a y of 0.450, behaves in much the same way, although 
since the 6 angles are even smaller, the first 2-intercept caustic disappears 
from the increased y side of the drawdown zone at an asymmetry of only 
+4 percent. In this case the 3-intercept caustic propagates at a lesser 
angle than the first 2-intercept caustic and disappears at around the 
same percent asymmetry and on the same side. 


3.2 Discussion 


Study of the collective response yields two general observations. First, 
it is apparent that all of the significant caustic rays rotate in a forward 
direction with increasing asymmetry: the first 2-intercept caustic being 
most responsive, the 3-intercept caustic slightly less so, and the third 
and second 2-intercept caustics even less so, in that order. Second, the 
angles between the opposing rays for each caustic appear to be fairly 
constant with changing asymmetry, except when one of the rays nears 
extinction by internal reflection. 

Figures 8a, 8b, and 8c illustrate these points. Each of these figures is 
a plot of the propagation angle 6 as a function of the percentage of y 
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Fig. 8—Plots of the 6 angles of the caustic trajectories for various ys as functions of the 
asymmetry, dy. Here the values of the angles, 6_, emergent on the side where y is decreased 
are plotted on the left; the values of the angles, 04, emergent on the side where y is increased 
are plotted on the right. These results are arranged as follows: (a) The first 2-intercept 
caustic angles, designated 0/1, and the J ray propagation angle, 0/, vs the asymmetry, dy, 
in percent. (b) The 3-intercept caustic angle, 0”, vs the asymmetry, 67, in percent. (c) The 
second and third 2-intercept caustic angles, 0¥2 and 642, vs the asymmetry, 67, in per- 
cent. 


asymmetry, dy. The values for the side where y is decreased, 0_, are 
plotted to the left; and the values for the side where y is increased, 0+, 
are plotted to the right of the zero asymmetry line. Figures 8a and 8b 
represent the response of the first 2-intercept and the 3-intercept caustics 
respectively, while the responses of both the second and third 2-intercept 
caustics are presented in Fig. 8c. The 6 response of the J ray is repre- 
sented by a broken curve where it appears alone on Fig. 8a, and broken 
curves are used in Fig. 8c to differentiate between the second and third 
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2-intercept caustics. Each curve shown is labeled to indicate the average 
y for which the range of asymmetries is now determined. Thus, to de- 
termine the @ angles for a first 2-intercept caustic ray on a melt zone of 
average y of 1.323 with an asymmetry of dy = +30 percent, one finds a 
value of 115 degrees on the left side of the plot as the value of 0! on the 
side of the melt zone where y is reduced 30 percent, and a value of 77.4 
degrees on the right of the plot as the value of 0! on the side of the melt 
zone where y is increased 30 percent. 

Except for those portions of the curves which are near points of ex- 
tinction, the curves* for both the first 2-intercept caustic and the 3- 
intercept caustic are similar in shape for all ys. In fact, over the range 
of most realistic asymmetries, which means the first +10 or +15 percent, 
these curves are almost linear. This indicates that for these caustics the 
total included angle between opposing caustic rays, 6, + 0_, remains 
constant while the angle of its bisector increases linearly with the in- 
creasing asymmetry. In terms of actual drawdown samples, these caustic 
rays form cones of light which remain fairly stable in shape but which 
tilt in the direction of these asymmetries and become distorted in those 
regions where they approach extinction by internal reflection. This is 
also true of the second 2-intercept caustic except that it seems to be 
relatively less sensitive to this asymmetry. 

Another way of illustrating this behavior is shown in Fig. 9. Here the 
average sum, 05 = 44(0_ + 64), is a direct measure of the constancy of 
the cone half-angle as a function of asymmetry, while the average dif- 
ference, 0p = 44(0_ — 0+), gives a direct measure of the tilt or rotation 
of the cone as a function of asymmetry. Figure 9a shows that the cone 
half-angle for the first 2-intercept caustic, 04¥!, remains fairly constant 
but does decrease a few degrees at high asymmetries. This may be par- 
tially accounted for by the fact that the average 8 angle does not remain 
constant with increasing asymmetry but decreases as shown in Fig. 10. 
Consequently, if the first 2-intercept caustic is somewhat more 
6-dependent, it too may be expected to decrease, since in Part I! on 
symmetric drawdowns it was shown that decreasing @ decreases 0. On 
the other hand, Fig. 9b shows fairly constant 3-intercept caustic-ray cone 
angles, 02, for all circumstances except those approaching extinction, 
as does Fig. 9c for the #5 angles of the second and third 2-intercept 
caustics. 

In all plots of Fig. 9, the curves for the average differences, 8p, vs 


* Many more calculations were made to produce these plots than were illustrated by the 
diagrams of Fig. 7, with particular attention given to accurate determination of the points 
where each caustic disappeared, if it did so within the range of interest. These terminations 
are shown exactly where they occur on the plots of Fig. 8, with arrows marking a contin- 
uation where the computation was stopped for other reasons, such as reaching the limits 
of the maximum range of interest or reaching the limits of the computable range of the 
input data. 
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Fig. 9—Plots of the averaged sums, 05 = 1(0_ + 6+), and the averaged differences, Op 
= 1,(0_ — 04), vs the asymmetry, dy, for various ys. These results are arranged as shown 
in Ya, 9b, and 9c. (a) The first 2-intercept caustic angle’s averaged sums and differences 
vs the asymmetry, +6y, in percent. (Figs. 9b and 9c on following pages.) 


asymmetry, oy, are nearly linear within the same limitation and indicate 
a constant slope or rate of rotation, rp. The greater slopes, rp, of the 
average difference curves in Fig. 9a indicate the higher response of the 
first 2-intercept caustic to changes in asymmetry, which is about 0.8 
degree/+1 percent. Also, their closer grouping indicates a significant 
degree of uniformity of this rate over a range of ys. The 0p vs 6y curves 
for the 3-intercept caustic ray angles are also nearly linear but shallower, 
with rates of rotation, rp, of about 0.6 degree/+1 percent. They also show 
considerably more variation from y to y. 

Figure 9c shows the same two functions (8s vs 6y and 8p vs oy) for the 
second and third 2-intercept caustics. For both, the half-cone angles, 
6s, are reasonably constant except near extinction (i.e., the second caustic 
at y = 0.68 degree), but the rates of rotation are smaller. These run from 
0.4 degree/+1 percent and less for the third caustic to 0.17 degree/+1 
percent and slightly negative for the second caustic, and with much 
greater variation from y to y. 

The results provide a means by which the caustic angles measured on 
opposite sides of a section through the melt zone may be used to estimate 
both average y and its asymmetry for that section. For example, since 
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Fig. 9(b)—The 3-intercept caustic angle’s averaged sums and differences vs the asym- 
metry, + dy, in percent. 


the averaged sum of the 3-intercept caustic angles, 08, is relatively in- 
sensitive to the asymmetry in vy (Fig. 9b), this parameter may be used 
to estimate y. In other words, 68 will be very nearly the same as 0” for 
the symmetric case. In turn, the dependence on y may be derived directly 
from the 6” vs 6 relationship given in Fig. 6 of Part I. Figure 11 shows 
a plot of 68 vs y adapted from this original plot of 6? vs 6 using y = tan 
Band 62 ~ 0. The figure also shows a plot of the inverse rate of rotation, 
rp! vs y. Therefore, for a drawdown where melt zone asymmetry is the 
result of an asymmetry in y, measurement of 6? and 62 will provide a 
means of estimating + and its asymmetry as follows: 

From 6? and 6?, compute 03 = (0? — 6?) and 08 = (02 + 62). 
Then use Fig. 11 to estimate a value of y and to find the corresponding 
value of rp!. The latter may then be used to compute the asymmetry, 
dy, by multiplying by 62. Consider the example of a sample whose 
3-intercept caustic angles measure 134 and 115 degrees. These yield a 
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+5y IN PERCENT 


Fig. 9(c)—The second and third 2-intercept caustic angles’ averaged sums and differ- 
ences vs the asymmetry, + dy, in percent. 


92 = 124.5 degrees which, by Fig. 11, gives an estimated y of 0.930. At 
this y, Fig. 11 also gives an rp! = 1.56. Since 02 = 9.5 degrees, the 
asymmetry, dy, is then 1.56 X.9.5 = 14.8 percent. Computer-generated 
data for a y of 0.926 with an asymmetry of +15 percent gives 0? = 134.1 
degrees and 0? = 115.0 degrees, which is within an absolute error of only 
0.004 in estimating y and 0.2 percent in estimating the asymmetry. This 
order of accuracy is consistent throughout the region for which 02 is 
independent of y. 

The development of additional data for the various 2-intercept 
caustics would permit similar analyses of data taken over the appropriate 
y or 8 ranges for which these caustics appear on both sides of the melt 
zone and manifest linear dependences on 6y or 68. 


IV. COMPARISON WITH AN ACTUAL DRAWDOWN SAMPLE 


Empirical profile data from Sample 1 discussed in Ref. 2 provide a 
comparison for the present technique. Figure 12 shows data taken from 
the 3 and 9 o’clock profiles superimposed on smooth curves computed 
from the averaged data used in the present report (from Sample 4). 
These synthesized profiles have been asymmetrically scaled (y = 1.006 
+ 15 percent) to match the 8 angles of the empirical data (64+ = 48.9 
degrees (3-side) and G_ = 44.6 degrees (9-side)). It can be seen that the 
empirical and computed values are very similar and superimposed (in 
this case) with no axial shift asymmetry (A/yR = 0). Various caustic 
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1.588, 0.926 — 
1,323, 1.025 ~ 





+5y IN PERCENT 


Fig. 10—Plots of the averaged sums, Bs = 1(B- + 84), and the averaged differences, 
Bp = '(B_ — B+), vs the asymmetry, 67, in percent, for various ys. Here G_ is the incident 
angle on the side of reduced y and 64 is the incident angle on the side of increased y. 


62 IN DEGREES 





0.4 0.6 0.8 1.0 1.2 1.4 1.6 1.8 
Y 


Fig. 11—Plots of the averaged sum of the caustic angles, 02, and the inverse rate of 
rotation (from their averaged difference curves), rp', vs y for the 3-intercept caustics. 


angles were computed from these profiles and are shown in Table II along 
with the angles measured in the original experiment and the values 
calculated at that time using the empirical profile data. In some cases, 
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——— AVERAGED SCALED DATA 
7 = 1,066 + 15%, A/y R = 0.0 


© OO EMPIRICAL DATA FROM 
SAMPLE 1(2) 


r/Ro 


STATION 3 
O 





Fig. 12—Plots of the empirical profile data from Sample 1 (Ref. 2) and the computed 
profiles obtained using the present technique for synthesizing asymmetric profiles (y = 
1.006 + 15 percent, A/yR = 0) with differing slopes (84 = 48.9 degrees, 8_ = 44.6 degrees). 
For Sample 1, Ro = 3.24 mm, and for Sample 4, Ro = 3.09 mm. 


Table ll—Caustic angles from profile data 


Measured Values From Calculated Values Using © Computed Values Using 
Original Experiments Empirical Profile Data Smoothed Avg’d Data 


(Ref. 2) (degrees) (Sample 1) (degrees) (Sample 4) (degrees) 

gu 46.1 54.5 EXT (43.2)* 

9¥ 12.5 75.3 75.0 

6? 130.6 133.8 129:1 

g? 140.0 142.7 145.4 

of 59.3 64.9 (59.1)* 

62 135.3 138.3 133.8 

ay 13.2 10.4 (15.9)* 

0p 4.7 4.5 17 


* In this case the caustic is extinguished by internal reflection, so the extinction angle itself 
is used in computing effective 0¥ and 04 values. 


the newer values are in better agreement with the observed angles than 
are those computed using the actual profiles, and certainly they appear 
to be about as good overall. 


Vv. SUMMARY 


Two types of asymmetry have been studied using computer simulation 
for predicting the trajectories of caustics emitted by a coaxially illumi- 
nated, fused-silica melt zone. The first type of asymmetry involves an 
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axial shift of one side of the melt zone with respect to the other. Increases 
in this type of asymmetry produce changes in the trajectories of the 
various types of caustics which appear. The appearance of these caustics 
also depends very strongly upon the average taper of the melt zone.” The 
following effects have been observed: 
(i) The first 2- and 3-intercept caustics rotate downstream when 
they appear on the trailing side of the melt zone. 
(it) The first 2-intercept caustic initially starts to rotate upstream 
and then reverses direction when it appears on the leading side. 
This effect is more pronounced in blunter melt zones. 

(iit) The first 3-intercept caustic generally rotates upstream on the 
leading side but may reverse its rotation for large asymmetries 
on very blunt melt zones. 

(iv) The second 3-intercept caustic appears on the leading side only 

for blunt melt zones (y = 0.450) and only at large asymmetries 
for which it rotates downstream. 
The second and third 2-intercept caustics rotate downstream 
on the leading side and upstream on the trailing side with in- 
creasing asymmetry. This is the reverse rotation of the first two 
major caustics. . 

(vi) The fourth 2-intercept caustic appears only on the leading side 
of melt zones which are moderately blunt (y < 0.688) and ro- 
tates downstream with increasing asymmetry. 

(vit) When a particular caustic appears on opposite sides of a melt 
zone, its intersecting trajectories form an angle which may be 
thought of as the included angle of acone of light emitted froma 
three-dimensional sample.* Cone angles of the first 3-inter- 
cept caustic and the second and third 2-intercept caustics are 
moderately insensitive to the asymmetry. 

(vit) The cone angle of the first 2-intercept caustic increases signif- 
icantly with increasing asymmetry. 

(ix) The cones of light for most of these caustics rotate forward with 
increasing asymmetry. The rotation of the cones of light for the 
3-intercept caustic and the first 2-intercept caustic is initially 
forward or toward the leading side and then reverses. 

(x) The light cones for the second and third 2-intercept caustics 
generally rotate backward at a constant rate with increasing 
asymmetry for a given y. 

(xt) The rate of rotation of the first 3-intercept caustic is generally 
greater than that of the first 2-intercept caustic, while the rates 
of rotation for the second and third 2-intercept caustics are al- 
ways negative. 


(v 


— 


* It should be kept in mind that the cone angle, as we have defined it here, is the sum of 
the particular caustic angles on opposite sides of the drawdown measured from the up- 
stream axial direction. 
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-‘The second type of asymmetry investigated involves scaling opposite 
sides of the melt zone differently to produce asymmetries in slope. In- 
creases in slope asymmetry produce changes in the external caustic 
trajectories, which are in many ways simpler than those observed with 
the axial shift asymmetry. These changes are as follows. 

(i) All caustics generally rotate downstream on the trailing (de- 
creased) y side and upstream on the leading (increased) y side. 
This is comparable to the behavior of the first 2- and 3-intercept 
caustic response to the shift asymmetry. 

(ut) The cone angles of the 3-intercept caustic and the second and 
third 2-intercept caustics are almost independent of the asym- 
metry over significant ranges. 

(zit) The cone angle of the first 2-intercept caustic is only weakly 
dependent on the asymmetry in y and may be varying in response 
to changes in the average value of 6, which itself changes with 
asymmetry. 

(tv) There is no second 3-intercept caustic, as was the case previ- 
ously. . 

(v) All the caustics rotate forward at constant rates for any partic- 
ular y. 

(vi) The rates of rotation are ordered as follows: 


ru1>lrp >1ru3> rue. 


For both types of asymmetry, sufficient rotation in any axial plane 
usually causes one of the caustic trajectories to be extinguished by in- 
ternal reflection. The approach of a caustic trajectory to its extinction 
usually results in an acceleration in its rate of rotation, thereby distorting 
the general pattern of movement of the cone of light for that caustic. 
Nevertheless, over much of the range of reasonable geometries there will 
be caustics whose rotations are sufficiently regular as to provide a direct 
means of estimating the geometrical asymmetry from the asymmetry 
of the caustic cone itself, if the caustic and type of asymmetry are 
properly identified. 
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Measurement of Echoes Due to Spurious TE, 
Modes in a Long-Distance 60-mm Waveguide 
Communication System 


By J. L. DOANE 
(Manuscript received May 19, 1978) 


To estimate the seriousness of spurious TEon Mode generation in a 
long-distance millimeter waveguide communication system requires 
a measurement of small echoes in the impulse response with delays up 
to about 500 ns. We report the results of such a measurement made as 
part of a recent 14-km field test of the WT4 communication system with 
a novel test set operating near 41 GHz. The results show that the WT4 
system can perform satisfactorily without any filters of spurious TEon 
modes. 


|. INTRODUCTION 


The performance of a long-distance millimeter waveguide commu- 
nication system can be seriously degraded by slowly decaying echo trains 
in the impulse response. Slowly decaying echo trains are caused by 
coupling over long distances of the desired signal mode (TE ) with for- 
ward traveling spurious TEo,, modes. When many echoes in such an echo 
train act in phase, the resulting peak distortion on a received signal can 
be quite large, even with relatively small total echo power. 

A previous paper! described an echo test set (Fig. 1) designed to detect 
such slowly decaying echo trains. This test set transmits a 20-ns pulse 
into a 60-mm waveguide line with a shorting plate at the end, samples 
the signal in 4-ns increments for 1 us near the returning pulse, transforms 
the data to loss and delay versus frequency, and overlaps data taken at 
different center frequencies, f,, to extend the frequency range of the data. 
Measurements were made with this test set as part of a recent 14-km field 
test of the WT4 system? between Netcong and Long Valley, New Jersey. 
These measurements indicated indirectly that the power in slowly 
decaying echo trains generated in the 60-mm waveguide itself was too 
small to degrade system performance.! 

To demonstrate overall system performance, however, we must de- 
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Fig. 1—Block diagram of the echo test set. 


termine the impulse response for the entire WT4 system, including the 
band diplexer trees at each end of the 60-mm waveguide. A band diplexer 
tree? contains several miter elbows and hybrids, each of which converts 
a relatively large amount of energy to and from spurious TEo, modes. 
Energy that is converted to TEo, modes in the long-distance 60-mm 
waveguide medium and then reconverted at the band diplexer tree (or 
vice versa) can cause echoes much larger than those generated in the 
waveguide medium alone. Miter elbows used for sharp waveguide bends 
in an experimental Japanese system caused serious ripples in the mea- 
sured loss versus frequency” and presumably serious echoes in the im- 
pulse response also. 

In this paper, we evaluate the seriousness of spurious TEo,, mode 
generation in the WT4 system from the levels of echoes in the impulse 
response. We explain first, in Section II, how to obtain the impulse re- 
sponse from measurements of loss and delay versus frequency. To es- 
tablish credibility for the results, we consider round-trip loss and delay 
measurements of the field test without the diplexer tree! and show that 
the level of the large echoes in the impluse response agrees with inde- 
pendent data. Since the echo test:set could not conveniently measure 
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through the diplexer tree, we made a discrete TEp, generator to simulate 
the tree. In Section II], we describe theoretical and experimental char- 
acterizations of the mode conversion levels in this TEo, generator and 
compare the level of TEgg with that expected from the diplexer tree. In 
Section IV, we compare the impulse response of Section IT with that 
obtained from round-trip measurements of the field test with the TEo,, 
generator in place. From the data in Sections III and IV, we also estimate 
in Section V the level of TE: generated within the waveguide itself and 
show that this level is consistent with an estimate based on mechanical 
measurements. We conclude, in Section VI, with a discussion of the 
implications of all these measurements on the WT4 system design. 


ll. DETERMINATION OF THE IMPULSE RESPONSE 
2.1 Theory 


The levels of echoes in the impulse response can be determined from 
the waveguide forward transfer function. First we write the total wave- 
guide transfer function J, for the TE9; mode (the desired signal mode) 
as: 


Io (f) = Pf)Ho(f), (1) 


where P(f) is the transfer function for ideal waveguide and contains the 
effects of dispersion and ohmic losses. For a length z of waveguide, this 
transfer function is 


P(f) = e—Toz = e—az@~Jbe, (2) 
Here a(f) and @(f) are the real and imaginary parts, respectively, of the 
propagation constant I’, for the TEo; mode. Reflections and mode con- 
version cause the normalized transfer function H, (f) to differ from unity. 
In the absence of reflections and when there is mode conversion from 
TE, to only one other mode, H, (f) is identical to the function G,[AT(f)] 
used in studies of mode conversion.® "8 There, AI is the differential 
propagation constant 


Aa + jAg = AT =T,—T,y, (3) 


where I’, is the propagation constant of the spurious mode. 

Since we know the ideal transfer function P(f), we consider now only 
the normalized impulse response h(t), which is the Fourier transform 
of H,(f): 


h(t) =f” HoPexpG2aft)af. (4) 


To obtain the normalized transfer function H,(f), we first remove the 
characteristics of ideal waveguide from the loss and delay 7(f) data. We 
integrate the residual delay data to obtain the residual phase versus 
frequency ¢(f) and then find H,(f) by exponentiation of the residual loss 
A(f) and phase: 
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H,(f) = exp(—A(f) + jo(f)). (5) 


To reduce echo sidelobe levels, we multiply the real and imaginary parts 
of H,(f) — 1 by a data window and transform the result using an FFT (fast 
Fourier transform). This procedure produces h(t) minus the delta 
function at zero delay. Finally, we square and sum the real and imaginary 
parts of h(t) to obtain the power in each echo as a function of delay. The 
echo power is corrected for the attenuation introduced by the data 
window. 


2.2 Example 


For illustration, we consider round-trip measurements of the 14-km 
waveguide medium without the TE, generator. The baseline for these 
measurements is the round trip to a closed shutter? at the manhole (see 
Fig. 2). Data over a 350-MHz band near 41 GHz are shown. in Fig. 3a. 
These data are the same as reported previously, except that the delay 
dispersion characteristic of ideal waveguide has been subtracted out. 

Most of the features of the data in Fig. 3a exhibit the expected be- 
havior. The decreasing loss versus frequency is characteristic of the 
ohmic loss in the waveguide wall. The level of the very rapid ripples is 
about at the level of the test set noise (0.01 dB rms).! 

The magnitude of the normalized impulse response shown in Fig. 3b 
reveals a pair of equal magnitude echoes separated from the main im- 
pulse by tp = +10 ns. These echoes correspond to the ripple seen in the 
delay data with period 1/t) = 100 MHz. Since these echoes produce a 
ripple in the delay but not in the loss, they must be 180 degrees out of 
phase.!° 

This pair of echoes is caused by reflections from a pressure window 
located 1.5 m in front of the shorting end cap in the test station (Fig. 2). 
This pressure window is a short length of foam with low relative dielectric 
constant « embedded in the 60-mm waveguide. Some energy is reflected 
from the pressure window without ever reaching the shorting end cap 
and returns to the test set 10 ns before the main pulse. Other energy, after 
being reflected from the end cap, is reflected from the pressure window 
back to the end cap a second time and finally arrives at the test set 
10 ns after the main pulse. 


NETCONG 


MAIN STATION TEST STATION 
TEST 
SET TEgn GENERATOR SHUTTER IN PRESSURE 
SWITCH (WHEN IN PLACE) MANHOLE WINDOW SHORT 
0 2—————————— 333 mm  —i13987.5 13989 








DISTANCE FROM TEST SET IN METERS 
Fig. 2—Distances in the wWT4 field test. 
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Fig. 3—(a) Measured loss and residual delay for the 27.21-km round trip between the 
manhole and the test station. (b) Normalized impulse response corresponding to 
Fig. 3a. 


The magnitude of this echo pair seen in Fig. 3b agrees with other 
measurements. Fault location measurements made at 41.24 GHz with 
the shorting end cap removed also gave 20 logi9|R| = —37 dB for the total 
reflection coefficient R at the location of the pressure window.!! Fur- 
thermore, since the two echoes are 180 degrees out of phase, it is easy to 
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show that the reflection coefficient R, for each end of the window must 
satisfy the relation R, = |R|/2 = 0.007. Then we infer that Ve — 1 ~ 
0.014, or « = 1.08, which is exactly the dielectric constant specified for 
the foam.!2 

The remaining echo power in the impulse response shown in Fig 3b 
is quite small. We conclude that there is negligible power in long echo 
trains caused by mode conversion and reconversion within the 60-mm 
waveguide medium itself. Considerably stronger echoes appear, however, 
when the TEo,, generator is inserted to simulate the diplexer tree. 


lll. TEg, GENERATOR CHARACTERIZATION 


The TEo, generator was designed for insertion on the 60-mm side of 
the circular waveguide taper at the Netcong test set platform (Figs. 2 
and 4). The interior of the TEo,, generator is a 76-mm (3-in.) length of 
reduced diameter waveguide. The inside diameter is 54 mm (2.125 in.) 
over the central 50.8-mm (2 in.) region. To reduce reflections, the di- 
ameter is tapered linearly up to 60 mm (2.362 in.) in 12.7-mm (0.5 in.) 
tapers on each end of the generator. 

Perturbation theory® applied to the above diameter variation predicts 
excitation of spurious modes with the levels shown in Table I. We may 
expect these estimates to be somewhat in error, because the diameter 
change from 60 to 54 mm is a fairly large fraction of the original diameter 
and therefore may violate the assumption inherent in the perturbation 
theory. 

We determined experimentally the levels of spurious TEo,, modes 
excited by the generator from the levels of ripples in the round-trip loss 
from the test set to the shutter in the manhole. We obtained a baseline 
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Fig. 4—Layout of test components at Netcong main station including the TEo,, gener- 
ator. 


Table | — Mode levels relative to TE9; at the TEo, generator 
output (perturbation theory) 
40 GHz 45 GHz 
TE2 ~—13dB -14dB 
TEo3 —16dB —15dB 
TEo4 —24dB —34dB 
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measurement in this case by closing the rectangular waveguide switch 
in the test set (see Figs. 1 and 4). The net round-trip measurement sees 
the effects of two equal discrete mode converters of strength x separated 
by a distance 2@, where @ is 381 m (see Fig. 2). The measured mode 
conversion loss A due to one spurious TEo,, mode is then:® 


A=x2(1 + e7 142124 cos AB22) nepers. (6) 


Here Aa and A@ are the differential attenuation and phase, respectively, 
of the TEo,, mode relative to TE; (see eq. (3)). The first term in eq. (6) 
is the round-trip insertion loss of the generator. Because A varies with 
frequency, the second term in eq. (6) represents a loss ripple. 

The period of the ripple in eq. (6) corresponds to a change in A@ of 
27/2. The period 6f in frequency is: 


df 
6 vi 
tang) op) o 
Equivalently, 
1 dAg 
trp=—= 28, 8 
a Be or dw ) a 
where tp is clearly the round-trip differential group delay of TEo,, relative 
to TE. 


The theoretical parameters for the lowest order TEo,, modes are shown 
in Table II for 41.3 GHz, which is close to the center frequency of our 
measurements. We also list for reference the beat wavelengths 27/AG 
and wall coupling coefficients =. For TLo, modes, these parameters are 
not very sensitive to the thickness or properties of the dielectric liner 
on the wall of the waveguide. 

The data in Fig. 5 show clearly that the TEp, generator excites TEgo, 
TEo3, and TEo4. The loss versus frequency data in Fig. 5a show the 
14-MHz ripple period expected from TEg2 in a 762-m round trip. The 
presence of TEo3 and TEo,4 is seen more easily after Fourier transforma- 
tion of the loss. 

With the help of eq. (6), we determined the mode conversion levels 
plotted in Fig. 5b. Defining A(k) as the N-point discrete Fourier trans- 
form of A(f) in eq. (6), we find that the peak magnitudes in A(R) are: 


Table Il — TEp, parameters at 41.3 GHz in 60-mm waveguide* 
TEo2 TEo3 TEo4 

| Act] 0.26 0.70 1.38 nepers/km 
a 22.73 59.76 113.23 radians/m 
27/AB 0.276 0.105 0.056 m 
z 1216 —1804 2447 me 
d(AB)/dw —0.092 —0.253 —0.514 ns/m 
tr(in i fa 70 193 392 ns 
6f (for 76 14.3 5.2 2.6 MHz 
8.686 (Kale tin 381 m). 0.86 2.32 4.57 dB 


* Assuming a 179-um dielectric liner with e, = 2.28 and tan 6 = 1.25 X 1074. 
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Fig. 5—(a) Measured loss for the 766-m round trip between the test set and the manhole 
with the TEo,, generator in place. (b) Modified Fourier transform of Fig. 5a, showing levels 
of TEon modes generated at the TEo, generator. 


| A(R)| = xo laal2e k = NtrAf, (9) 


where Af is the sampling interval in frequency, and tp is defined in 
eq. (8). For these data, Af = 1 MHz, corresponding to the 1-ys total sam- 
pling interval of the original time domain measurements.! Inverting 
eq. (9) to find the mode conversion levels x, we obtain: 


a, = 20 logig x = 10 logio (Cae) + 8.686|Aal|2, k= NtpAf. (10) 


The ratio of 8.686|Aa|£ to round-trip delay tr is very nearly constant 
for TEo, modes and is within 3 percent of 0.012 dB/ns for the three modes 
and conditions of Table II. Using this fact to evaluate the last term in 
eq. (10), we then plotted eq. (10) in Fig. 5b from the data in Fig. 5a. 

The data in Fig. 5b agree quite well with the theoretical expectations 
for the TEo,, generator listed in Table I. The peaks corresponding to TE92, 
TEo3, and TEo, occur at delays within 1 ns of those predicted in Ta- 
ble II. The mode levels inferred from Fig. 5b are —12 dB, —17 dB, and 
—23 dB relative to TEp; for TEo2, TEo3, and TEo4, respectively. In par- 
ticular, the level of TEo2 generated is roughly equal to the worst possible 
TEo2 mode level that could be generated in the band diplexer tree be- 
tween 40 and 110 GHz.!8 
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IV. LONG-DISTANCE ECHO TRAIN MEASUREMENTS 
WITH THE TE,, GENERATOR 


The presence of the TEp, generator causes the round-trip measure- 
ments of the 14-km field test to differ significantly from the measure- 
ments presented in Section II. In this section, we examine these differ- 
ences, first in the frequency domain and then in the time domain. The 
time domain data provide an estimate of the impulse response of the 
entire waveguide system including the diplexer tree. 

Data from measurements with and without the TEo, generator are 
displayed together in Fig. 6a. For these measurements, the shorting 
switch in the echo test set was used to obtain the baseline. The difference 
in average loss with and without the TEp, generator is due mainly to the 
mode conversion loss for a round trip through the generator. For the 
mode conversion levels observed in Fig. 5 for TEo2, TEo3, and TEo4, this 
loss is theoretically 0.77 dB. The additional discrepancy is consistent 
with the test set’s +1 percent accuracy in absolute loss.! A minor dif- 
ference in average delay is caused by a slight difference in the pressure 
of nitrogen filling the waveguide. [We used only the nominal pressure 
to estimate the effect of the refractive index of nitrogen on the ideal 
transfer function P(f) of eq. (1).] 

The level of the rapidly varying ripples increased substantially after 
insertion of the TEo, generator. An additional slowly varying loss ripple 
has also appeared in Fig. 6a that is not present in the measurement made 
with the baseline in the manhole (Fig. 3a). The origin of these ripples 
is clarified by the time domain data in Fig. 6b, which was obtained from 
Fig. 6a by the method described in Section II. 

An additional echo at a 10-ns delay is evident in Fig. 6b and is due to 
a reflection from the precision attenuator in the test set. The echoes 
leading and lagging the main signal are no longer equal in level, as they 
were when the baseline was taken at the manhole (see Fig. 3b and Section 
II). Correspondingly, a loss ripple with a period of about 100 MHz ap- 
pears in Fig. 6a. 

The large echoes in Fig. 6b with delays of 35 and 96 ns may be due to 
mode conversion at the TEo, generator followed by reconversion near 
the manhole (and vice versa). The differential delays for the 381 meters 
between the generator and the manhole are indeed 35 and 96 ns for TEg2 
and TEo3, respectively (see Table II). 

The power in echoes with long delays generally is substantially greater 
with the TEp, generator in place than without it. For example, the in- 
tegrated total power in the echoes shown in Fig. 6b with delays greater 
than 50 ns, excluding the echo at 96 ns, is 44 dB below the signal. In 
contrast, the power in such echoes (and test set noise) shown in Fig. 3b 
is 57 dB below the signal. These levels of echo power are consistent with 
the expected levels of spurious TEo, modes generated in the waveguide 
itself, as discussed next. 
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Fig. 6—(a) Measured loss and residual delay for the 27.98-km round trip between the 
test set and the test station. Measurements with and without the TEo,, generator in place 
are shown. (b) Normalized impulse response corresponding to the data in Fig. 6a for the 
TEpon generator in place. 
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V. COUPLING TO TE), MODES IN THE 
BURIED 60-mm WAVEGUIDE MEDIUM 


5.1 Levels estimated from electrical measurements 


From the measurement data presented in Sections III and IV, we can 
now estimate the amount of spurious TEp, mode power generated in the 
buried 60-mm waveguide. First, from Section HI (Fig. 5 and Table II), 
there is no evidence that the TEo,, generator excites any modes but those 
in the TEo, family. Therefore, the increase in power in echoes with long 
delay seen in Fig. 6b is due solely to mode conversion to and from TEo,, 
modes. 

For simplicity, let us assume that spurious TEo, energy generated in 
the buried waveguide is predominantly in the TEo2 mode. Let us also 
assume that the level of TEo2 generated per unit length does not vary 
greatly with position along the 14-km route. Because TEg2 has a low 
differential attenuation per unit length | Aa], almost all the TEo2 energy 
reconverted at the TEo,, generator is then in echoes with delays longer 
than, say, 50 ns. 

To calculate the level of TEo2 incident on the TEo,, generator from the 
waveguide, we argue in reverse. If the TEg2 level returning from a round 
trip in the 60-mm waveguide and incident on the TEo,, generator were 
—38 dB, then the reconverted power seen in TE; would be 12 dB lower, 
or —50 dB. However, the total reconverted power also contains a con- 
tribution from energy converted to TE: at the generator at the beginning 
of the round trip and then reconverted along the waveguide. Since the 
coupling is completely reciprocal, the echoes caused by coupling to and 
from TEo2 on the forward and reverse legs of the round trip will add in 
phase. Hence the total reconverted power in echoes would be 6 dB 
greater, or —44 dB, as was measured (excluding the echo at 96 ns and 
those with delays less than 50 ns). 

Coupling to higher order TE, modes probably causes a smaller frac- 
tion of the total echo power than does coupling to TEo2, because those 
modes have a higher differential attenuation and also less coupling at 
the TEo, generator (see Tables I and II). The level of TE92 generated 
through continuous coupling in a one-way trip through the buried 
waveguide is therefore roughly 40 dB below the level of TE9}. 


5.2 Levels estimated from mechanical measurements 


From independent measurements, we can estimate the level of TEo2 
that is generated by various coupling mechanisms. These mechanisms 
include continuous coupling at diameter distortions in helix and 
dielectric-lined waveguide and coupling at localized diameter distortions 
at the waveguide flanges. 

The average TEo2 spurious mode level generated by continuous cou- 
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pling in dielectric-lined waveguide can be calculated from the spectrum 
of waveguide diameter variation with distance.® At the mechanical fre- 
quency AG/2zx appropriate for coupling to TEg2 near 41 GHz, the average 
spectrum measured on individual waveguide sections corresponds to an 
average TEo; mode conversion loss in 14 km of about 5 X 1075 neper 
(0.0004 dB). According to the coupled power equations,!4 the average 
random spurious mode power P builds up to a steady state independent 
of the length L of waveguide, when the total differential attenuation - 
|Aa|L is large. This steady-state level is 


_ 4A) 
[Aal’ 





P (11) 


when (A) is the average TEo; loss per unit distance. Taking | Aa| from 
Table II, we find 10 log;9 P to be about —48 dB for continuous conversion 
in dielectric-lined waveguide. 

Analysis of diameter data for some helix mode filters in the field test 
showed that TEog levels of about —51 dB could be generated in some of 
these filters near 50 GHz./5 If we assume every mode filter generates that 
much TEo and take into account the effect of differential attenuation, 
we find that the steady-state level P after 14 km is about 3 dB higher, 
or P = —48 dB. 

The TEo level generated at the flanges can be very serious, but the 
randomized waveguide length scheme used in the field test reduces this 
level considerably.!© Without randomization of lengths, the steady-state 
TEo2 power after a long distance would be only 16 dB below the power 
in TE. (This level was calculated from measured diameter distortions 
near flanges.) Randomization of the waveguide lengths destroys the 
coherent build-up of spurious mode power and thus keeps the average 
random TEo2 mode power P at asteady state 45 dB below the power in 
TEo1. 

The sum of the calculated TEo2 mode power generated by the above 
three sources of coupling is —42 dB, which is close to the value of 
—40 dB inferred above from the electrical measurements. While the 
levels calculated from the mechanical and electrical measurements are 
only rough estimates, the agreement between them is reasonable. 

Reconversion of TEo2 mode power at strong couplers such as the TEo, 
generator or the diplexer tree is evidently always more serious than 
continual conversion and reconversion along the waveguide alone. The 
calculated reconverted power caused by the above three sources of mode 
coupling without the TEo,, generator in place is less than —80 dB and thus 
is completely invisible in Fig. 3. For the case without randomized lengths, 
the total power reconverted from TEg2 with and without the TEo,, gen- 
erator would be —22 dB and —50 dB, respectively. 
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Vi. CONCLUSIONS 


The power in long echo trains caused by coupling between TEo; and 
spurious TEo, modes is too small to affect the performance of the WT4 
system. In the two- and four-phase WT4 systems, the designed thermal 
noise levels are 22 and 28 dB below the carrier, respectively.!” By com- 
parison, the estimated maximum power in long echo trains in the WT4 
field test is about 44 dB below the signal power after a 28-km round trip 
including the diplexer tree. The system thermal noise will therefore 
certainly swamp out the power in long echo trains. 

Assuming the waveguide lengths are randomized as in the field test, 
the power in long echo trains should be no greater in a 50-km or 60-km 
repeater hop than in the field test. After a certain distance in 60-mm 
waveguide, the average random power in any spurious TEo, mode reaches 
a steady state independent of the length of the waveguide. Echo power 
generated by continuous mode conversion and reconversion in the 
60-mm waveguide alone does increase indefinitely with the length of the 
waveguide. The level of this echo power, however, is generally much 
smaller than the echo power caused by reconversion at the band diplexer 
tree of spurious TEp, mode power generated in the 60-mm wave- 
guide. 

Filters of spurious TEp,, modes are therefore not necessary in the WT4 
system. Such filters are difficult to fabricate without introducing sig- 
nificant TEo; insertion loss, and they also require obstructions within 
the waveguide!® that may interfere with waveguide maintenance. 
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Polarization for 19-GHz Satellite Signals 
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Rain and ice crystals depolarize radio waves along earth-satellite 
propagation paths. The magnitude of this depolarization is a function 
of incident polarization angle and ts minimized when polarization and 
depolarizer symmetry axes coincide. A technique is presented for a 
direct determination of the medium’s attenuation and depolarization 
for any incident polarization, based on measurements taken at two 
orthogonal polarizations. Some sample results from this technique are 
presented, using data collected at Crawford Hill, New Jersey using the 
19-GHz COMSTAR satellite beacon. 


I. INTRODUCTION 


Depolarization caused by rain and ice crystals is an important factor 
in the design of future satellite communication systems operating at 
frequencies above 10 GHz.!:2 These systems will likely use dual, or- 
thogonal polarizations to increase transmission capacity. Quantitative 
knowledge of depolarization is necessary for determining the pair of 
polarizations that experience the least depolarization, for determining 
whether the isolation between any two polarizations is adequate during 
rain and ice depolarizing conditions, or for guiding the design of circuits 
for canceling crosstalk resulting from depolarization if the isolation is 
not adequate. 

Only a few measurements have been made of depolarization along 
earth-space propagation paths.?—-> These measurements were not nec- 
essarily made at incident polarizations that produce minimum depo- 
larization. Aerodynamic forces acting on vertically falling raindrops are 
expected to orient raindrop symmetry axes vertical and horizontal on 
the average. Thus, rain depolarization is expected to minimize at linear 
horizontal and vertical polarizations. Maximum depolarization is ex- 
pected for linear polarizations oriented 45 degrees to horizontal and for 
circular polarizations.®>§ These expectations are supported by mea- 
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surements for terrestrial propagation paths, but there is no experimental 
evidence available to either confirm or refute them for earth-space paths. 
Depolarization caused by ice crystals with unknown orientations raises 
additional uncertainty in the expected behavior of depolarization along 
earth-space paths.’ 

It is difficult and expensive to instrument satellites to measure de- 
polarization directly at several incident polarizations. A technique in- 
volving direct measurement of elements in the polarization transmission 
matrix and direct calculation of depolarization for other incident po- 
larizations was proposed for the COMSTAR beacon propagation experi- 
ments.®:9 This calculation method has the potential for answering many 
of the questions regarding rain and ice depolarization along earth-space 
propagation paths. This paper presents what are believed to be the first 
experimental results obtained using the technique.!9 A minimum in 
depolarization occurs for vertical and horizontal incident polarizations 
for the attenuation event analyzed. 


ll. THE MEASUREMENTS 
2.1 Equipment and signal parameters 


The transmitting source for the propagation measurements is the 
19-GHz beacon!! on the COMSTAR satellite located at 95° W longitude. 
This beacon output is switched at a 1-kHz rate between two linear or- 
thogonal polarizations. These 19-GHz signals, among others, are received 
at Crawford Hill, New Jersey with a 7-m diameter antenna and precision 
receiving electronics described in Refs. 12 and 13. 

The polarizations of the beacon signals received at Crawford Hill are 
rotated 21 degrees from horizontal and vertical (for simplicity, these 
signals are referred to as H and V). Amplitudes of copolarized, V and H, 
and cross-polarized, XV and XH, signal components are measured for 
the two transmitted polarizations. Phase differences, ¢, referenced to 
the V signal are measured for the H, XV, and XH signals. The receiver 
3-dB predetection bandwidths of 10 Hz in co- and crosspolarized 
channels yield a dynamic range of 60 dB between the clear air copolarized 
signal level and the receiver noise level; crosspolarized signal amplitudes 
are also measured in 1-Hz bandwidths yielding a 70-dB dynamic range. 
Post-detection bandwidths are 1 Hz for all amplitude and phase mea- 
surements. 

Residual signals in the crosspolarized signal channels are produced 
by transmitting and receiving antenna imperfections. These residuals 
are <—35 dB below the copolarized signal levels everywhere within the 
—3 dB beamwidth of the receiving antenna. Residuals are <—40 dB 
everywhere within the beam for the XV channel (XV is the channel for 
transmit vertical and receive horizontal); on-axis residuals are typically 
<—45 dB for XV and —36 to —40 dB for XH. 
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2.2 Calibration 


The crosspolarized signal channels are calibrated with the antenna 
feed in the normal receiving position. The receiver polarization switches 
(see Figs. 1 and 14 of Ref. 13) are temporarily configured to connect the 
H and V copolarized signals through the XV and XH channels respec- 
tively. These known signal levels are then used to establish the gains of 
the crosspolarized signal channels. These channel gains remain within 
+0.1 dB over many months. The H and V signals are within 0.05 dB of 
each other as measured by physically rotating the receiving antenna feed 
assembly 90 degrees. The phase scale for the differential phase, ¢V-H, 
is calibrated by reversing the polarity of one signal input to produce a 
180-degree phase change. 

Determination of the zero on the phase scales for the crosspolarized 
signals, 6V-XV and #V-XH, requires simultaneous insertion of signals 
with known phase relationship into the receiving feeds. Rotating the feed 
assembly inserts projected spatial components of the copolarized signals 
into both feeds. These components have the same phase and will produce 
signals with either 0- or 180-degree phase difference in the crosspolarized 
signal channels; the sense, 0 or 180, depends on the direction of feed 
rotation. Contamination by the residual crosspolarized signals produces 
an uncertainty in this zero of about +1 degree. This zero calibration 
remains within +2 degrees over many months. The phase scale is cali- 
brated by reversing the polarity of one signal input while rotating the 
feed assembly for the zero determination. Note that the phases of the 
clear-air residual crosspolarized signals are not suitable for determining 
phase scale and zero because they are low level and their phases are ex- 
tremely sensitive to the accuracy of the nulling of the residuals with feed 
assembly rotation.’ 


2.3 Baseline removal 


The amplitudes and phases of co- and crosspolarized beacon signals 
exhibit small diurnal variations. These diurnal baseline variations repeat 
very closely from day to day, but change somewhat from season to season. 
The variations result from thermal changes in the satellite beacon cir- 
cuits, waveguides, and antennas as the solar illumination changes. 
Day-to-day receiver contributions to baseline variations are insignificant. 
Because of the repeatability of these baseline variations, they can be 
removed from the experimental data. 

V and H attenuations and differential phase, ¢V-H, are determined 
by subtracting, from values measured during propagation events, the 
values measured at the same clock times on clear days within a few days 
of the events. The differential values of attenuation and phase that are 
critical in the calculation of depolarization for other incident polariza- 
tions are determined within +0.2 dB and +0.5 degree. 
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Baseline removal of residual crosspolarized components is done by 
vectorially subtracting, from the values measured during events, the 
weighted residual components at the same clock times. These vector 
subtractions can be made since both amplitudes and phases are mea- 
sured. Prelaunch measurements of the beacon antennas, pattern range 
measurements on the receiving antenna, and measurements with the 
receiving feed at normal polarization and rotated 90 degrees all suggest 
that the XH residual is dominated by the beacon antenna. Thus, the XH 
residual is weighted by the event attenuation and differential phase as 
though it were all contributed by the beacon. Weighting of the XV re- 
sidual assumes equal contribution by beacon and receiving antennas. 
The vector subtraction of clear-air crosspolarized channel residuals 
suppresses the residuals to —45 to —50 dB below copolarized signal levels 
before and after events and on adjacent clear days. This accuracy de- 
grades with the intensity of the propagation event because of the un- 
certainty in assigning residual contributions to beacon and receiving 
antennas. However, during weak events when depolarization is low and 
residual suppression is needed, the residual is suppressed to —45 to —50 
dB; when the event is stronger and the suppression degrades, the de- 
polarization is strong and the relative errors contributed by the residual 
are less significant anyway. 


iil. POLARIZATION ROTATION TECHNIQUE 


This section introduces the concept of the medium transmission 
matrix for a given set of incident and received polarizations. Receiver 
outputs directly generate the transmission matrix for the incident po- 
larizations; results for other incident polarizations are generated through 
rotations of the measured matrix. 

Let us assume two orthogonal signals, Ey, and E77», incident on the 
medium. The medium outputs in the same reference frame are Erg and 
Epp. The medium is then completely described by four coupling coef- 
ficients, as shown in Fig. 1. Coefficients A and D represent the copolar- 
ized signal attenuations of the medium, while B and C indicate the me- 
dium’s depolarization. Note that these coefficients are, in general, 
complex. The above relation may be written in matrix form: 


Pelies leat m 





Fig. 1—Description of propagation medium by coupling coefficients A, B, C, D (Eva 
; and £p'transmitted, Erg and Epp received). 
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Since the 19-GHz COMSTAR beacon transmits alternately on two or- 
thogonal polarizations, E7, and Er», are alternately 0. Receiver out- 
puts®!8 are thus directly proportional to A and C (for Evy = 0) or B and 
D (Evra = 0). The transmission matrix Ty for the actual incident po- 
larizations is given by these measured values. 


Am | 


(Tal = lo Du 


(2) 


This measured transmission matrix may now be used to generate the 
transmission matrix for any other orthogonal set of incident polariza- 
tions. This operation is done without recourse to path models. Linear, 
circular, or elliptical polarizations are admissible; the following discussion 
considers only linear polarization. As shown in Fig. 2, the transmission 
matrix for the 1~2 frame is desired, based on measurements obtained 
in the M, — Mz, frame. Components in the 1—2 frame are obtained from 
those in the M, — M, frame through the following relation: 


E Ema 
Le. = [Rel lea e 
where 
_|cos@ —siné 
al = | sind cos 0 | oe 


To generate the transmission matrix for incident polarizations along the 
1—2 axes, the 1-2 inputs are rotated into the measurement frame by [Rg], 
passed through the transmission matrix [Ty], and rotated back to the 
1-2 frame by [R,]—}, ie., 


| act | = Weel (Paalteal | 57]. 6) 


The transmission matrix [Ty] for two orthogonal polarizations rotated 
an angle 0 from the measurement frame is thus 


[Tmo] = [Re]! [Tm] [Rol. (6) 
2 1 
* 
\N 
Be \ 6 Ma 
aN 
aN 


Fig. 2—Relation between reference frames of measured (M,z — Mj) and desired (1 — 
2) transmission matrices. 
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IV. OBSERVATIONS 


The accuracy of the polarization rotation technique described here 
is critically dependent on amplitude and phase calibration of all four 
receiver channels.? To verify the end-to-end operation of the receiver 
and rotation software, a depolarizer with known symmetry axes was 
inserted near the receiving antenna focus in clear weather. Data were 
taken at several orientations of the depolarizer with respect to the actual 
received polarizations. For each case, depolarization was computed as 
a function of incident polarization angle relative to the depolarizer 
symmetry axes. All results were identical, with minimum depolarization 
along the depolarizer symmetry axes. This agreement validates both the 
receiver calibration and rotation software. 

Data from a recent rainstorm were analyzed to determine the de- 
pendence of depolarization and attenuation on incident polarization 
angle. Since little depolarization was observed before or after the high- 
attenuation portion of the event, it appeared that rain, rather than ice, 
was the predominant depolarizing agent.7-19 

Some effects of incident polarization angle are shown in Fig. 3 for three 
points in this storm. These points had maximum vertical copolarized 
attenuations of 5.4, 10.4, and 20.3 dB. The lower three curves show the 
effect of vertical depolarization (the ratio of vertical crosspolarized to 
copolarized levels) as a function of polarization rotation from local ver- 
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tical to horizontal. The experimental data, taken at a 21-degree rotation 
angle, appear along the dashed line. All three curves exhibit a sharp null 
within 2 degrees of local vertical, indicating a mean raindrop canting 
angle close to 0 degree.5:610 

The upper three curves indicate the variation in differential attenu- 
ation (the ratio of horizontal to vertical copolarized attenuations) with 
polarization angle. Maximum differential attenuation coincides with 
minimum depolarization, since the raindrops then exhibit their maxi- 
mum oblateness along the two incident polarizations. Differential at- 
tenuation changes sign at 45 degrees from this point, as “vertical” and 
“horizontal” interchange roles. 

The temporal history of a portion of this rainstorm is shown in 
Fig. 4. The upper two curves indicate maximum depolarization (for 
45-degree polarization) and the concurrent copolarized attenuation. A 
maximum depolarization of —11 dB was observed at 30-dB copolarized 
attenuation. Maximum depolarization exceeded —20 dB for all copo- 
larized attenuations exceeding 11 dB. 

The lower curves indicate time variations for polarization angles in 
the vicinity of the nulls shown in Fig. 3. The central curve indicates the 
polarization angle exhibiting minimum depolarization, while the outer 
curves show the —20 and —30 dB depolarization contours around this 
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Fig. 4—Temporal history of portion of April 20, 1978 rainstorm. Upper curves show 
copolarized attenuation and maximum depolarization. Lower curves show angular location 
and extent of depolarization minima. 
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minimum. For this portion of this event, the polarization angle exhibiting 
minimum depolarization remained within 2 degrees of local vertical. 
Depolarization remained below — 30 dB for all polarizations within 
3.5 degrees of local vertical, and below —20 dB within 12 degrees of local 
vertical. 

The relation between attenuation and depolarization appeared to be 
well-behaved for the section shown of this rainstorm. Other regions, 
however, appeared less homogeneous. Figure 5 shows the polarization 
angle dependence of depolarization and differential attenuation for two 
points in the storm separated by 19 minutes. Both points had 10.5-dB 
copolarized attenuation. The earlier point, however, had 2.5-dB lower 
maximum depolarization, almost 50 percent lower differential attenu- 
ation, and more than 50 percent greater maximum differential phase 
shift between copolarized channels. In addition, the depolarization null 
was filled in below —41 dB. All these changes are consistent with lower 
average raindrop ellipticity (or less preferential drop orientation) and 
the presence of a small amount of depolarization due to ice crystals.7:!9 
Very preliminary observations of other events have suggested that de- 
polarization nulls occasionally deviate far from local vertical during 
periods of predominantly ice depolarization. 
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V. CONCLUSIONS 


A method has been presented, based on transmission measurements 
at one set of incident polarizations, for direct calculation of attenuation 
and depolarization of the transmission medium for any incident polar- 
ization. This technique has been applied to 19-GHz data from the 
Crawford Hill COMSTAR beacon propagation receiver. Test results in- 
dicate that the receiver calibration accuracy is sufficient to provide useful 
information on polarization-dependent effects along earth-space 
propagation paths. 

The rainstorm analyzed exhibited a sharp depolarization minimum 
for incident polarizations within 2 degrees of vertical and horizontal. This 
implies that the mean raindrop symmetry axis was nearly vertical. The 
minimum depolarization was sufficient for communication systems 
employing polarization reuse. 

Similar observations of ice-produced depolarization events suggest 
that the depolarization minima for these events occur occasionally at 
polarizations other than vertical and horizontal. This implies ice particle 
alignment by other than gravity. The simultaneous presence of ice and 
rain with differing symmetry axes could destroy the depolarization nulls 
observed with either separately. Such events, if frequent, could havea 
serious impact on dual-polarization communication systems. The 
technique described here will allow further investigation of this and other 
polarization-dependent propagation phenomena on earth-space 
paths. 
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In a conventional PAM data transmission system, the transmitted 
signal is x(t) = 2 a,g(t — nT), where {a,} is a 24-level data sequence, 
and g(t) is a Nyquist pulse (g(0) ¥ 0, g(mT) = 0,m # 0). Ideally, the 
bandwidth of the pulse g(t) and, therefore, the bandwidth of x(t) can 
be made equal to 1/2T = p/2L, where p is the data rate. In practice, 
however, an “excess bandwidth” of at least 10 to 20 percent is re- 
quired. 

Using a class of real sequences called “discrete prolate spheroidal 
sequences,” we show how to construct a modulated signal with band- 
width just slightly in excess of the optimal p/2L (say, by 2 to 4 percent). 
The new signal is similar in many ways to a conventional PAM signal, 
and in particular an ad-hoc receiver structure ts suggested for which 
the resulting error performance is about the same as for a conventional 
PAM system operating in the same environment. 


I. INTRODUCTION 


To fix ideas, consider the following conventional (baseband) PAM 
data-transmission scheme (see, for example, Ref. 1). The data to be 
transmitted is a sequence {a;}2.. The a, are independent identically 
distributed copies of the random variable a, which is uniformly dis- 
tributed on the set {+1, +3,--+,+2/ — 1}. Thus, a takes 2” equally likely 
values, where L = 1, 2,---, is a fixed parameter. The modulated signal 
is 


xo(t)= SX apgolt — kTo), (1) 


where go(-) is a real-valued “Nyquist pulse”—i.e., 
go(0) € 0, 
£o(RTo) = 0, k £0. (2) 
Since (2) implies that x9 (RT) = azg (0), k = 0, +1, +2,---, the data 
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sequence {a;,} can be obtained from xo(-) simply by sampling. We assume 
that the Fourier transform 


Golf= f_ goltvePlat, = <f<e, (3) 


of go(-) has support on the interval [—F'o, +Fo], where Fp < 1/T. Under 
this assumption, the Nyquist condition (2) is known! to be equivalent 
to 


Golf) + Go (f— 7) = Te, OS FS 7 (4) 


0 
(except perhaps in a set of measure zero). Figure 1 is an example of a real 
Go(f) which satisfies (4). An often-used Nyquist pulse Go(f) is the so- 
called raised-cosine pulse. (See Ref. 1, pp. 50-51.) The bandwidth of xo(-), 
which is the same as the bandwidth of go(-), is taken as Fp. The difference 
Fo — 1/2T is called the “excess bandwidth.”’ 

To conserve bandwidth, it is desirable to make Fp as close to 1/2T 
as possible, but typically (Fo — 1/2To)/Fo = 10 to 20 percent in real 
systems. Further reduction in the excess bandwidth is difficult, since 
the very sharp cutoff filter used to generate go(t) with F9 close to 1/2T9 
will introduce either phase distortion or ripples in the amplitude char- 
acteristic. 

In a practical data transmission system for the voice-grade telephone 
channel, a reduction in bandwidth is also desirable, since the channel 
characteristics at the band edges are poor. 

In this paper, we suggest another approach to the signal design 
problem which will allow a further reduction in the excess bandwidth, 
perhaps to as little as 2 to 4 percent. The technique involves a family of 
sequences called “discrete prolate spheroidal sequences” (DPSS) and 
is also intimately tied up with notions concerning the space of square 
summable sequences (l2). Therefore, before presenting our scheme, we 
must digress to review some notions about the space /2 and to introduce 
the DPsSSs. We do this in Sections I and III, respectively. In Section [V 
we discuss our new scheme. 


Gif) 


EXCESS BANDWIDTH 
Tp 90!9) 


| 
| 
To 
3 90(0) | 
| 
| 





1/2T Fo 1/To f 
Fig. 1—Example of real G(f) satisfying eq. (4). 
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At this point, I would like to acknowledge with thanks three of my 
colleagues without whose help this project could never have gotten off 
the ground. D. Slepian introduced me to DPSSs, and with much kindness 
and no small amount of work helped me to get numerical values for these 
sequences and their associated eigenvalues. J. Mazo taught me most of 
what I know about data communication, and J. Salz’s interest and en- 
thusiasm stimulated me to obtain a full understanding of the properties 
of the modulation scheme. 


li. REVIEW OF THE SPACE /2 


The space l» of square-summable, real-valued sequences is the set of 
sequences {a(n)}°__.. (or a(-)) such that 


~ a(n) <e. (5) 
Let a(-), b(-) € lo; then ee product of a(-) and b(-) is 
(a,b) = ¥ a(n)b(n). (6) 
Also, the norm of a(-) is —_ 
llall = (a, a)¥/2, (7) 


We will need the following facts. For a(-), b(-), c(-) € Jo, and any real 
number y, 


(ya,b+c) = y(a,b) + y(a,¢), (8) 
which implies that, for aj € lo, j =1,2,---, 
IZ ah= 5 lajl2+2 5 (a;,ay), (9) 
Further, the Shes Homa is, for a, - E lo, 
[<a, b)| < llall lol. (10) 
For a(-) € la, the (sequence) Fourier transform is defined by 
Ar(f) = > a(n je=tenis, SOc p< @, (11) 


where 7’ > 0 is a fixed parameter. Of course, A7(f) is periodic with period 
1/T, and usually we will be concerned only with its values on the interval 
[—(1/2T), (1/2T)]. The sequence {a(n)} can be recovered from A7(-) by 
the formula 


a(n)=T oe Aplfye2tTrdf, -o<n<o. (12) 
The convolution aes states that if 


c(n)= > a(m)b(n—m) 


m=—-ca 
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(which we denote c = a*b), then 
Cr(f) = Ar(ABrff), ce a (13) 


where A7, Br, and C7 are the transforms of a, b, and c, respectively. 
The Parseval relation is, for a, b € lo, 


+1/2T 
(a,b) =T f. von AMABr Hal, (14) 


where “‘*”’ denotes complex conjugate. Thus, in particular, 
+1/2T 
lal?=(aay=7 f lAr@Paf. (15) 
—1/2T 


We say that a sequence a(-) € ly is bandlimited to [0, F],0 < F < 1/2T, 
if its transform A7(f) = 0, for F < |f| < 1/2T. Thus, a bandlimited a(-) 
can be written 


a(n) = f. " A(peiiTndf (16) 


A sequence a(-) has support on the interval [N,, No], ~-~7 < Ny < No 
< o, if a(n) = 0, for n ¢ [N,, No]. A sequence with support on 
[N1, No], where |Nj|, |No| < ©, cannot be bandlimited to [0, F'] with 
F< 1/2T. 

It is convenient to define the bandlimiting operator on lo, B = Br, 
0 <F <1/2T, by (fora é€ lg) 


Ba = b, (17a) 
where 
b(n) = [Arif Taf, (17) 
In other words, the transform of b(-) is 
cs a ae (16) 
0, =e \f\ Ss oT’ 


A sequence a € lg is bandlimited to [0, F] iff Bra = a. Corresponding 
to the operator Br, we also define the complementary operator #’ = 
By =I — Bp, where I is the identity operator. 
We also define the index-limiting (or time-limiting) operator D = 
Dn (1 SN <~@), by (fora € la) 
Da= b, (18a) 
where 
a(n), 1<xn<N 
b(n) = : ? 18b 
(n) le otherwise. 18D) 
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Thus a € ly has support on [1, N] iff Dya = a. We will need the following 
easily established propositions. 
Proposition 1: Let x(t), -» <t < «, bea real-valued function with 
ordinary Fourier transform (as defined by (3)) X(f), -~7 <f < ~. Let 
the sequence a(-) be defined by a(n) = x(nT). Then, the sequence 
Fourier transform of a(-) is 
1 2 k 

AOR gas 2), <f<e. 

In particular, if x(t) is bandlimited to F Hz, and 1/T > 2F, then 


1 1 
Arf) =X), I/| <r 


Thus the sequence a(-) is bandlimited to [0,F]. 


Proposition 2: Let a(-) € ls, and let g(t) be a real-valued function of 
the continuous variable t. Let 


x(t)= ¥ a(njg(t-nT), —-<t<o. 


n=—a@ 


Then the ordinary Fourier transform of x(t) ts 
X(f) = Ar (Gf), —o <f<o, 


where A7(f) is the sequence Fourier transform of a(-) and G(f) is the 
ordinary Fourier transform of g(t). 


ill. DISCRETE PROLATE SPHEROIDAL SEQUENCES 

Let T, F > 0 (where W = FT < 1/2) and N > 1 be given; let the oper- 
ators B = Br, D = Dn beas defined in Section II. The following theo- 
rem is proved in Appendix A. 
Theorem 3: There exists a set of real sequences {¢;(-)};, called “discrete 
prolate spheroidal sequences” (DPSS), with support on [1, N] and a 
corresponding set of real numbers fA, called “eigenvalues,” with the 
following properties. 


N 
(A) 1 Z2AyZ Agee = An > O, and > Aj = 2FTN. 
j=l 
(B) DBO; = joj, 1 <jJ<N. 
(C) (bj, bp) = dip 
(D) (Boj, Bor) = Aj5jx 
(KE) With 6 > 0, and F, T held fixed, as N > ~, 
1 (number of j such that i 
N 6<Aj<1—6 
(F) With «> 0, and F, T held fixed,as N > &, 


AeFTN(1-.) — L, 


NOFTN(I +6) — 9. 
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(G) (Slepian [Ref. 4, eq. (63)|), with « > 0 and F, T held fixed, as 
N= oo, 


1- NQFTN(1—6) = exp{—-C(e)N + o(N)}, 
where C(e) > 0. 


(H) The $;(-) and )j, 1 Sj < N, depend on F, T only through their 
product W = FT. 


Remarks: 

(i) In the course of giving the proof of Theorem 3, we will show ex- 
plicitly how to find the DPsss {¢;} and the corresponding {A,}. 

(ii) Theorem 3A, F implies that, with N large, about 2F'T'N of the Ajs 
are about 1, and that the remainder (about (1 — 2FT)N) of the );s are 
about 0. Theorem 3G indicates that the convergence as N — is quite 
rapid. Since this fact is crucial to our modulation scheme, we list some 
of the Ajs for FT = ,, and various values of N in Table I. Here 2FTN 
= N/2, so that about half of the \;s are 1 and the remainder are about 


0. 
Table | — {d,} for W = 0.25, for N= 5, 10, 20, 50, 100 
J dj J Aj 
N=5 1 0.9976686 15 0.0000212 
2 0.9244132 16 0.0000008 
3 0.5000000 17 0.0000000 
4 0.0755868 18 0.0000000 
5 0.0023143 19 0.0000000 
20 0.0000000 
N =10 1 0.9999994 
2 0.9999490 N =50 1-21 >0.9997 
3 0.9980787 99 0.998 
4 0.9650286 23 0.985 
5 0.7326630 24 0.914 
6 0.2673371 25 0.680 
7 0.0349714 26 0.320 
8 0.0019213 27 0.086 
9 0.0000510 28 0.015 
10 0.0000005 29 0.002 
30-50 <0.00023 
N = 20 1 0.9999999 
2 0.9999999 N = 100 1-45 >0.9998 
3 0.9999999 46 0.9993 
4 0.9999999 47 0.996 
5 0.9999992 48 0.976 
6 0.9999788 49 0.892 
7 0.9995798 50 0.664 
8 0.9940340 51 0.336 
9 0.94385514 52 0.108 
10 0.7070557 53 0.024 
11 0.2929445 54 0.004 
12 0.0564486 55 0.0007 
13 0.0059659 56-100 <0.0001 
14 0.0004201 
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(iii) Theorem 3C implies that ||¢;||2 = 1, and Theorem 3D implies 
that ||B¢ j |2 =X j. Thus, the fraction of the energy of ¢; within the band 
[0, F'] is \j. Therefore, when ); is close to unity, ¢; is a sequence with 
support on [1, N] with most of its energy in the band [0, F]. Theorem 3 
implies that, with N large, there are about 2FTN orthogonal sequences, 
ie., | GY = 1, 2,---, 2FTN(1 — ¢)), with support on [1, N] which are 
approximately bandlimited to [0, F]. 

(tv) Slepian has made an exceptionally detailed study of DPSSs and 
their properties. Reference 4 contains most of his results, and Ref. 5 
describes a Fortran program for computing the DPSSs and their eigen- 
values. 


IV. HEURISTIC DESCRIPTION OF THE MODULATION SCHEME 


Let the data to be transmitted be as in Section I, the 24-level sequence 
{aj}2.. We break this sequence into blocks of length v, where the kth 
block is ap,41,°°*, &(k+1) ~° <k < ~, and where v is an integer to be 
chosen later. Consider the Oth block a, -+-, a,. Let N > v be another 
integer parameter, and let F, T > 0, with FT < '4, be given. Let 
pj, Aj, 1 Sj SN, be the DPsss and eigenvalues guaranteed by Theorem 
3, with parameters N, F, T. Then define the sequence 

a(n) = > aj¢;(n), —-o<n<o, (19) 
=1 


j 
Observe that a(-), like the ¢;(-), has support on the interval [1, N]. Fur- 
ther, if we take vy = 2FTN(1 — ©), with N sufficiently large so that A, ~ 
1, then from Theorem 3 (see remark iii), a(-) is approximately band- 
limited to [0, F']. 

Now the modulated waveform corresponding to the Oth data block 
is 


ag sk 
xo(t) = DY a(n)g(t — nT), —x <t<o, (20) 
n=l] 
where the pulse g(t) has Fourier transform G(f) which satisfies 
iT <F, 
ana lh Ml ; (21a) 
0 . 
tera 
leQ(ar, FS (21b) 


27 
Thus, we do not specify G(f) in the interval [F,, 1/2T], except by (21b). 
Since G(f) need not have sharp transitions, it is not difficult to imple- 
ment in practice. For the kth data block (-~2 < k < &), ag,44, -°°, 
Q(k+1)», we set 
a(n) = ¥ apajoj(n-Nk), Nk+1<n<N(k+1), (22) 


jJ=1 
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and let the modulated waveform be 


Nk + 1) 
xe(t)= Do a(n)g(t — nT). (23) 
n=Nk+1 
The entire modulated signal is 
x(t) = e2 xp(t)= YO a(n)g(t - nT). (24) 


Since the number of bits in each data block is Lv and each data block 
“occupies” NT seconds, the transmission rate is 


v\ L 
= (—) — bits/s. 25 
p = (x) qbits/ (25) 

We now give an intuitive, though imprecise, explanation of the 
properties of the modulation scheme. Consider xo(t) given by (20). Its 
Fourier transform is, from Proposition 2, 


Xol(f) = Ar(fGa(f), (26a) 


where 


Ar(f) = 3 alnje- ite 


n=1 
= ¥ a bir (f) (26b) 
: 


where ®;7 is the sequence Fourier transform of ¢;(-). In the light of 
remark iii following Theorem 3, the {®;7(f)}/_; and therefore A7(f) are 
approximately zero for |f| € [F, 1/2T] provided v < 2FN(1 — ¢). Since 
G(f) is bounded in this interval and 0 for |f| > 1/27, we see that Xo(f) 
is approximately bandlimited to |f| < F. Further, if we take » = 
2FTN(1 — «), we have from (25) that the transmission rate p is 2FL(1 
— ¢). Thus in our scheme we can transmit 2F(1 — «), 24-level data sym- 
bols per second with bandwidth F. If « = 0, then we would have effec- 
tively constructed a PAM system with no excess bandwidth. Since, in 
practice, « can be made very small, we can in fact come quite close to the 
ideal. 

So far so good. But we still must show that the data symbols {a;}{ can 
be recovered conveniently from xo(t). In fact, we claim that the samples 
xo(nT) ~ a(n), 1 < n < N. The key observation here is that, since 
Ar(f) ~ 0, |f| € [F, 1/2T], then X0(f) is not appreciably changed when 
G(f) is replaced by G;(f) (“I” for “ideal’’) where 


Te Ape ey, 
G = 

MW) F \f| > 1/27. 
The inverse transform of G, is g7(t) = (sin xt/T)/(at/T). Let us therefore 


(27) 
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define x;(t) by replacing g(t) by g;(t) in the definition of xo(t). We ob- 
tain 


a(t) = alndgi(t ~ 27) 


so that 
x(nT) = a(n), l<n<N. 
It follows that 
xo(nT) — a(n) = x0Q(nT) — x,(nT), l1<n<QN. 
Now define the sequence c(-) by 


c(n) = xo(nT)—x;(nT), 1snsNn. 


Since xo(t) — x;(t) is bandlimited to 1/2T Hz, we have from Proposi- 
tion 1 that 


Cr(f) = - [XoN- XL ll Son, 
and from Proposition 2 that 
Cr(f) = = [Xolf) ~ Xi(H) 
= = Ar(MIG) ~ G(f)]. 
From the Parseval relation (15), 


> [xq(nT) — a(n)}?< ell? 


1/2T 
= 2 
Tf ler lat 


1 
ae ee r say PEONIED) SGI 
<4T |Ar(f)|2df = 4|| Beall? ~ 0. 








F<f<1/2T 
The inequality follows from (21b), which implies that 
IG(f) — Gr(f)| < 2T. 


Thus, a(n), 1 < n < N can be recovered from xo(t) simply by sam- 
pling. From (19) and the orthonormality of the {¢;}1, 


aj= 2 alndoj(n), (28) 
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so that the {a;}{ can be recovered from samples x(nT), 1 <n SN. 
Aside from imprecision, the above arguments completely ignored the 
effects of the other data blocks (k + 0) and the effects of channel dis- 
tortion and noise. In the next section, we give a precise definition of the 
modulation scheme and of a proposed receiver, and then state theorems 
that give bounds on the error introduced by linear channel distortion 


and a channel noise. We will also bound the instantaneous power 
x2(t). 


V. PRECISE STATEMENT OF RESULTS AND DISCUSSION 


Let {a;}2.. be, as in Section IV, a sequence of independent, identically 
distributed copies of the 24-valued random variable a, where 


Pr{a = m} = 2-4, m = +1,+38,---,4+(24—-1). (29) 

The sequence {a;} is the data sequence to be transmitted. Let v,N,F,T 
> 0 be parameters such that pv, N are integers, and 

vp<QN, (30a) 

W = FT < 1/2. (30b) 

Partition the data sequence into blocks of length v, such that the kth 

block is 
Qyk+ 1, °° *, Ap(k+1)> 


k= 0,+1, +2,---. Let {¢;(-), Aiea be the quantities (DPSSs and eigen- 
values) whose existence is guaranteed by Theorem 3 with parameters 
N, W. Corresponding to the kth data block, define the sequence a; (-) 
by 


a(n) =D aynsjoj(n — NR), (31a) 
J=1 
and let 


a(-) = oy ap (:). (31b) 


k=—o@ 
Since ¢;(-) has support on [1, N], a,(-) has support on [Nk + 1, 
N(k + 1)]. Finally, the modulated signal is 


x(t)= XS x(t), —e<t<e, (32a) 
k=—o 
where 


xe(t)= Oo a,(n)g(t — nT), 


n=—a 
N(k+1) 


= D> azp(n)g(t -nT), (32b) 
n=Nk+1 
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and the pulse g(t) is the inverse Fourier transform of G(f), which we 
leave unspecified for now. 

A block diagram for the modulator described above is given in Fig. 2. 
The data symbols appear at a rate of vy/NT per second. Box A takes the 
data symbols v at a time and calculates the numbers {a(n)}—producing 
N outputs for every v inputs. Thus, the a(n) appear at a rate of 1/T per 
second. Box B produces x(t) by modulating the amplitude of a pulse 
train with the {a(n)}. Although we will not specify the ratio v/N and the 
pulse g(t) now, it will be useful to informally think of 


v/N = 2FTN(1 — «-) = 2WN(1 — 0), 


and g(t) <> G(f) as in (21). We will allow the possibility of non-physically 
realizable pulses g(t), with the usual understanding that a close ap- 
proximation to g(t) can be obtained with a finite delay (which we shall 


ignore). 
The received signal is taken as 
y(t) = w(t) + z(t), (38a) 
where 
w(t) = ff ” SG) hl =a)dr: (33b) 


and where h,(t) is the impulse response of the channel (H,(f), the 
channel transfer function, is the transform of h,(t)), and z(t) is noise 
with zero mean and power spectral density Nz(f). 

We now turn to the receiver. We will postulate a simple receiver 
structure which, though not optimum, has the virtues of simplicity and 
amenability to analysis. Furthermore it is probably not very far from 
being optimal itself. Refer to Fig. 3. The received waveform y(t) is first 
sampled at ¢ = nT, to produce the sequence {y(nT)}"__... These samples 
are the input to box C, a tapped delay line with 2M + 1 taps. The output 
of box C is the sequence {4(n)} given by 


{aj} {a(n)} Le x(t) = SS) a(n) g(t-n7) 
“fon PA Pe A anive 


Fig. 2—The modulator. 


a(n) = > c(m) y ((n—m)T) 


|m|SM 


y (t) = w(t) +2 (t) 7 y (nT) \ {a} {4;} 
: c SLICER 
1/T 


TAPPED CORRELATOR 
DELAY LINE 


Fig. 3—The receiver. 
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a(n) = e c(m)y((n — m)T), —-x<n<~o, (34) 
where {c(m)}t™ are the tap weight coefficients. As we shall see, the tap 
weights should be chosen so that the sequence 4(-) is the receiver’s best 
guess of the sequence a(-). 

Consider the Oth block of {4(n)}, i.e., (1), ---, G(N). If d(n) = a(n), 
then the Oth block of data symbols a,,--+, a, could be recovered from 
{a(n)} using (28). Our approach will be to use these same formulas to 
obtain an estimate {a;} of the {a;}, i.e., 


N 
ay => G(n)o;(n), Lis7 Sy (35a) 
n=1 
For the remaining blocks (k # 0), we proceed analogously, viz., 
N(k+1) ; 
Ary+j = YD G(n)d;(n — Nk), 1l<j<<v». (35b) 
n=Nk+1 


This is box D. The final step in the demodulation process is a “‘slicer,” 
which examines @;(—» <j < ~) and emits a; where &; equals a value 
of m € {+1, +3,-++, +24 — 1} which minimizes |@; — m|]. 

As in (25), the transmission rate is p = (v/N) - (L/T) bits/s. 

We are now ready to state the properties of the modulation scheme 
in the form of theorems. The proofs of these theorems are given in Sec- 
tion VI. Theorem 4 gives an upper bound on the average power of the 
transmitted signal x(t). Theorem 5 gives an upper bound on the expected 
instantaneous power E x(t), as a function of t. Finally, Theorem 6 gives 
an upper bound on the mean-squared error. We state these results with 
no restrictions on G(f), H.(f), and Nz(f). In the remarks which follow 
_ the statement of the theorems, we will look at some interesting special 
cases. 

We begin with a bit of notation. Denote the variance of the data ran- 
dom variable a, defined in (29), by 


_ L 
e=Eat=Il > m2 = 22 — DO"+ 1) (36) 
|m|<2b-1 3 ; 
m odd 

Also, for v,N,W = FT satisfying (80), let 

1 
Q= QW,N, W)== > -d). (37) 

j= 


Of course, if we set » = 2WN(1 — 6), and let N — ~, with W, «> 0 held 
fixed, then Q — 0. It will be helpful to think of Q as a small quantity. 
Here are the theorems. Although they may seem formidable at first 
glance, please stick with it! In the extensive discussion following the 
theorem statements, you will see that they can be easily applied and yield 
useful information. 

Theorem 4: (average power) 
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byt ar tC 2a 
4 t)dt 
AV NT J, y 


2 


FSS loU-Bl ar gae} ow 


Ai= 


where 


foe) 


G (/ 7 *) | 7 (38b) 





sup, 
Fs|f[<1/2T T? p=—0 
Theorem 5: For -— t, < 2, 








1/2T | @ ky. 2 
E x2(t,) < Fe G ( 7) iQrfty df. 
eA) T J-1/2T pore f T : | 2) 
Theorem 6: (mean-squared error) 
las 
@2-E YS (4; — aj)? = i + é;, (40) 
Vv J=1 


(N stands for noise, and I for intersymbol interference). The noise error 
ex is bounded by 


RT ler? ( SNe (f-Z))ar+4.@ (aia 


where 
Coif) = 3 el(njeniet ls (41b) 
Nz(f) is the power spectral densi of the noise, and 
Ag gop: EES ay (f 7m 7): (41¢) 
Fs|fjs12T T  ra-@ T 


The intersymbol interference error ¢? is bounded by 


d<o3 (— aie f Lerner - 1 af + As@ |, (42a) 


where 
Boh! _k _k 
BI) gee 2 (/ 7) a (f r) ae) 
C'r(f) is given by (41b), and 
A3= sup |[Cr(f)Br(f) —1|2. (42c) 
F<|f|<1/2T 
Remarks: 


(z) The reason that Pay as defined by (38a) is the “average power” 
is that the random function x(t) is cyclostationary with period NT. In 
other words, the shifted sequence X(t) = x(t — RNT) has the same sta- 
tistical properties as x(t) itself (for k = 0, £1, +2,---). Thus, it follows 
that 


lim E a e “x%(¢)de = = ape f- x2(t)dt = Pay. (43) 
T —7/ 


To 
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(ii) When Q is small, the upper bound of (38a) on Pay depends es- 
sentially on the folded power spectrum 


© k 
= o(f-g) WSF 
Furthermore, if G(f) = 0 for |f| < 1/27, then 
k 
LG(F-F)=40, MSF 


so that Theorem 4 becomes 


F : v 
Paves] ( [ord + sae), Uda) 
where 
A,= sup |G(f)|? (44b) 


Fs |f|<1/2T 


(iii) If we assume, as in Remark ii, that G(f) = 0, |f| > 1/2T, then 
Theorem 5 becomes 


ext) <% (" 
. aie =, 
Thus the upper bound on E x?(t) depends on G(f) for |f| S$ 1/2T. 
(tv) Saltzberg’s? bound can be applied (see Appendix B) to our 
problem to show that the distribution for the instantaneous power 
satisfies 


oT 
|G(f) [Pdf (45) 
OT 


2 Q<_ =e E| 

Pr{|x(t)|? > r%} < 2 exp | oF 20) 
The bound of Theorem 5 can be applied here to further overbound this 
probability. 

(v) We now explain the rationale for using the mean-squared error 
e = (1/v) Yi, E(a; — a;)?. First note that with & = E(a@ — a;)?, 
Saltzberg’s bound (see Appendix B) can again be used to show that, if 
the noise is Gaussian, then 


1 
Po = Pra; 4 aj} S 2 exp {- 5a 
2€; 
Since the sequence of random pairs {a;, &;};~_. is cyclostationary with 
period p, the overall error probability is 


lv 22 1 
P,== > Pj S=¥ exp |~ 53]: (46) 
Vv j=1 V j=1 26 
Now let ¢3,.. = max ¢?. Ineq. (46) yields 


1<j<v 
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1 
P. S 2exp Ee a 
Thus it would appear that ¢2,,, rather than ¢? is the appropriate criterion. 
Nevertheless the use of ¢? (which was, of course, chosen for its mathe- 
matical tractability) can be justified by the following argument. 

Let R be the v X v covariance matrix with (7, k)th entry E(a; — a;): 
(@, — az), 1 SJ, Rk S v. Let Mo be av X v orthogonal matrix such that 
the diagonal elements of MjRMoare all identical.* One choice of Mo is 
Mo = M,Mz, where M, is av X v orthogonal matrix which diagonalizes 
R (i.e., M{ RM, is a diagonal matrix), and M2 is a normalized Hadamard 
matrix (i.e., a vy X v orthogonal matrix with entries +1/Vv). A normalized 
Hadamard matrix is known to exist for all »y which are multiples of 4 
up to 200. 

Now modify the communication system as follows. Preceding box A 
in Fig. 2, insert a device which multiplies the data symbols {a;}, taken 
in blocks of v, by the matrix Mo. Then, following box D in Fig. 3, insert 
a device which multiplies the input {a;}, taken in corresponding blocks 
of length v, by Mj! = Mj, Let the output of this device be {a}. It is easy 
to show that (1) the analysis which yields Theorems 4 to 6 is unchanged 
in the modified system and (ii) E(a; — aj)?=@,1<j <». 

We must emphasize that the choice of the matrix Mp depends on R 
which in turn depends on the channel which is usually unknown or 
variable. Although it is undoubtedly possible to find an adaptive pro- 
cedure for finding a good matrix Mo, our conjecture is that, in most real 
situations, Mo can be chosen to be any normalized Hadamard matrix 
with fairly good results. 

(vi) To obtain more insight into our scheme, let us assume that 
Q ~ 0, and G(f) = H.(f) = N(f) = 0, |f| > 1/2T. Then (38), (41), (42) 


become 
oz fF 
Pies f. Gal, (47a) 
ds ~ f. ICr(P|2N (df, (47b) 


fs oT (“) f “[RernGw@He 1) df. (Te) 


We see immediately that our bounds on the important quantities Pay, 
ex, €7 depend on G(f), H.(f), N(f) only for |f| < F and not for F < |f| 
< 1/2T- In particular, we need a channel whose bandwidth is F’. Since 
the transmission rate 

vy Db yp OL y 


Beg OE a at 
PNT N 2FT 2WN 





* Witsenhausen (Ref. 6) has shown that there always exists an Mg with the desired 
property for all »y = 1, 2,--- 
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we have that 





al 9] re as 


Now the ideal bandwidth in a conventional 24-level PAM system with 
rate p is (p/2L). We pointed out in Section I that the required channel 
bandwidth in real systems is usually no less than 10 to 20 percent in ex- 
cess of this. For our system with vy = 2WN(1 — ©), the ratio of the required 
bandwidth given by (48) to p/2L is (1 — e)—!, which can be made very 
small. See the numerical example in remark vii. 

(vii) Roughly speaking, Theorem 6 tells us that there are three sources 
of error. The first is given by the integral in (41a) which is a bound on 
the error introduced by the noise in the band [—F, F']. The second is given 
by the integral in (42a), which is a bound on the error introduced by the 
imperfections of the channel, as compensated by Cr(f), in the band 
[—F, F]. The third source of error is the fact that Q > 0. The first two of 
these sources appear in conventional PAM systems, but the last is unique 
to our system. The following numerical example shows that Q can in fact 
be made small. 

Let W = 2FT = 0.415, N = 80, v = 64. Then Q(v, N, W) = (1/v) diay 
(1 — A;) = 1.01 X 1074, which corresponds to —40.0 dB. The ratio 2WN/v 
= 1.0375, so that the required bandwidth is 3.7 percent in excess of the 
ideal p/2L. 

Continuing with this example, let us say that 1/T = 6 X 103/s, F = W/T 
= 2490 Hz, and L = 2. Then the transmission rate p = (v/N) - (L/T) = 
9.6 kb/s. Note that 1/2T = 3 kHz, so that the system performance is es- 
sentially independent of the channel characteristics or noise in the band 
[2.49 kHz, 3 kHz]. Of course, we are assuming that H,(f) = N(f) = 0, |f| 
> 3 kHz. 

Finally, observe that the receiver-correlator (box D in Fig. 3) must 
perform N - y multiplications every N - T second, or v/T multiplications 
per second. For v = 64, and 1/T = 6 X 103, this works out to one multi- 
plication every 2.6 us. To store the N X vy ¢j(n), 1Sn<sN,15j S», 
to say 10-bit accuracy, we need a ROM with capacity 10-N-v= 
51.2. kb. 

(viit) Continuing with the assumptions made in remark vi, let us fur- 
ther assume that G(f) = T, |f| < F, and H,(f) =1, |f| < F, ie., a perfect 
channel response (in band). Also, let N(f) = No/2, |f| < 1/2T. Then, 
from (47), 


Pay S 2FT o2. 


Further, the upper bound on the total mean-squared error is minimized 
for Cr(f) =1 (i.e., c(0) = 1, c(m) = 0, m ¥ 0). Then 


N CFNo Gg LN 
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If p/N ~ 2FT, the total mean-squared error 


es alae a2. (49) 
Pav 

Note that (NoF) is the noise power in the band [—F, F]. Observe that in 

a conventional PAM system with a perfect Nyquist equivalent channel 

and additive white noise, the mean-squared error is given the right 

member of (49) with F = the Nyquist bandwidth (see Ref. 1, Chap. 

5). 

(ix) Suppose that it is desired to transmit our data using a modulated 
signal x(t) which is bandpass in the band [F, Fs]. Then, using quadra- 
ture amplitude modulation (QAM) in a straightforward manner, we can 
modify the present scheme to achieve a bandpass signal. We will now 
outline the procedure. 

Let 0 < F, < Fe be given. Set F = (Fo — F;)/2, and choose T < 1/2 F. 
With F, T so chosen, form two modulated signals (with independent 
data) according to our (baseband) prescription. Denote these baseband 
signals by x(t), x(t). Their rates are each vL/NT. Then form a 
bandpass signal 


x(t) = x(t) cos 2aF.t + x(t) sin 2xF,t, 


where F, = (F>. + F,)/2. The signal x(t) is essentially bandpass with 
lower frequency 


ee ae (A=) - (2) Si. 


Z 
and upper frequency F, + F = F>. The transmission rate for x(t) is 
2vL vp L vy L 
SS 0 OF) Sa) Ss FP 
= NT Nope (x) w! 2— Fy) 


Thus the required channel bandwidth to pass x(t) is 
p /2WN 
eae) 25 ( p ) 

exactly as in the baseband case (48). It is a fairly simple matter to analyze 
the QAM system and obtain results analogous to Theorems 4 to 6. 

Another way of accomplishing the synthesis of a bandpass signal is 
to use “bandpass” DPSSs instead of the conventional DPSS characterized 
in Section III. 

(x) Combining (31) and (32), we can rewrite the modulated signal 
as 


x(t)= YS alti), 


jJ=H—@ 
where for kv <j < (k + 1), 

Nk) 

a4 y)= DL dj-r(n — Nk)g(t — nT). 


n=Nk+1 
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Note that, for s = 0, +1, £2,---, 


so that there are only v possible shapes for g(t, /). 

We conclude from this that the present system is a kind of PAM, with 
the data {a;} modulating the amplitude pulses {g(t, j)}. 

(xi) Computation of the Tap Weights: Let us again make the as- 
sumptions of remark vi. Then, to minimize ¢’, it is a reasonable strategy 
to choose the coefficients {c(m)}“y so that Cr(f)G(f)H- (f) is as close as 
possible to unity for |f| < F. Of course, we must take care not to enhance 
the noise by making C'7(f) too large. In fact, it is a simple matter to solve 
for the optimal set {c(m)}“y, which minimizes our bound on the total 
mean-squared error (with Q ~ 0) 


- (*r) lf, erin 2 ay 


+ of (ern G(f) Hef) = 
is T 


2 
1 af | (50) 





Let the sequences £0(-), £:(-), £o(-) be the inverse transforms of 


N1/2(f) GAH ATA 





por) eg ames) 
respectively, where 
1, <F 
r= th ll 
PSS 
0, F<Ifls55 


Then the bound of (50) is, from the Parseval relation (15) (‘‘*” indicates 
convolution), 


N 
a [llcxEqll? + o2|lexEy — Eoll?] 


-* y {| 53) c(mm’)go(n — m’) | 


m'=— 
; M 2 
to2[ Sem ela — m’) — ten) ||. 
m=—- 
Differentiating with respect to c(m), -M < m < M, and setting the re- 
sult equal to zero, yields 


M fo] 
y e(m’) |  £o(n — m)éo(n — m’) 
M n=—o 


m'=— 


$2 n-mMEea(n- m’)} = > £&(n—m)éto(n), 


n=—-o0 n=-0 
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or 


s c(m)uo(m — m’) = py(m)}, m=0,+1,---,+M, (51a) 
M 


m=— 


where 


uo(m) = io lEo(n)Eo(n — m) + o2E(n)E\(n —m)] (5b) 


mlm) = X &(n —m)éo(n), (51c) 
Clearly, uo(-) is the inverse transform of 
2 2 
T tT 
and y;(-) is the inverse transform of (02/T) G* (f)Hz(f)I'(f). The tap 
weights {c(m)}“y, are found by solving the linear equations (51a). 

Of course, the above computation of the tap weight coefficients is 
possible only when the channel transfer function H,(f) and the noise 
spectrum N(f) are known. In most real applications, these quantities 
are unknown or changing, so that an adaptive learning technique is re- 
quired. 


Vi. PROOF OF THEOREMS 
Proof ue Theorem 4: Using (32), we have 


Pav* E NT me 
AV “Nr a (t)dt 
= 2 
aN? a xn(t) ) dt 
1 2 
ge eed 
~kNT+ 
~ NT 2 
eas ee xo(t)dt 
nre So xe(t)dt = oak f. | Xo(f)|2df. 


Step ( iy ie from the atte of the {a;}=.., which implies [see 
(31) and (82b)] that EF x; (t)x,-(t) = 0, k # k’. Step (2) follows from (32b), 
and the fact that {a, (n)}“%4}), has the same statistics as {ao(n)}_). Thus, 
x;,(t) has the same statistics as xo(t — RNT). 

We next apply Proposition 2, which implies that 


Xol(f) = Ar PGA), (58a) 


where 


Ar(f) = > a(n)e~t27fPn | (53b) 


n=1 
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Substituting (53a) into (52), we obtain 
1 fore) 
Pay= <a f. |Ar(f)|21G ()|2df 
1 © +1)/T 
= LES, IGMlArMler 


ee S,|¢ (F-5) | arerer, 


where we have used the fact that A7(f) is periodic with period 1/T. 
Thus 





Pav=xa Sf. (|e (*- |) @larniner 


1 +1/2T 2 
NT —1/2T (,2, |¢ ( ) BlarOPaf, 


where the last step follows from the fact that the integrand is periodic 
with period 1/T, so that we can change the interval of integration from 
[0, 1/T] to [— 1/2T, 1/2T]. Continuing, we have 


Pav=xn f (2. |e (f-5)| ) elarolar 


Hp” Sreineuer ( [6 (F~ 5) ) laroren 
=[y4+Io. (54) 


Now the second integral Jz can be overbounded by 











1 1 1 
InS——(A EY Ar(f)|2df = — Ay TE = || B’aoll2, 
Spt desi 


where A, is defined by (38b). From (31), with k = 0, and (B’¢;,B’¢;") 
= (1 — dj)6;;, we have 














1 y ; 
Ip AE % aj; B'o; 
1 vy 
=—A,E ¥ a3(1—d 
N 1 Day ) 
= yt 2Q, (55) 


where Q is defined by (37). 
To overbound J, the first term in (54), we again use (31) to obtain 


E|Ar(f)|2 =E | = aj Pjr(f) 
2 
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where ;7(f) is the transform of ¢;(-). Since E aja; = 025;;, we have 


E|ArA2= X o2fOr(N/2 


< ay 
by Theorem 7 (proved in Appendix C). Thus 


nsE SCE le(-Di)e 


Substituting (55) and (56) into (54) yields Theorem 4. 
Proof of Theorem 5: Let t}, -~ <t, < ©, be given. Then from (32) and 


(31), 
és o N(ktl) » 
x(t) = a xe(ty= > D~ LX apr joj(n — NR)g(t; — nT) 


k=—-© n=Nk+1 j=l 








N(k+1) 
=X Eatees| dail — Nedgtts ~ 07) | 
k J n=Nk+1 
Using E aja; = 026;,, we obtain 


N(k+1) 2 
BxXn)y= Sot] S” oj(n-Negtts—a7)]|. (7) 


ka—@ j= Nk+1 
Now with t, held a define the sequence c(-) by 
c(n) = g(t; — nT), —o<n<o, 
Also for -—» <k < o, 1 <j X »v, define the sequences ¢;;(-) by 
prj (n) = oj(n — RN), —-o <n<~o, 


Thus pj (-) has support in the interval [Nk + 1,N(k + 1)]. Of course, for 
—o <k,k’< oo, 1 <j,j’ S», 
_{L RRR, 
Cbrjs dey’) 0, otherwise, 


so that {¢x;},,; is a family of orthonormal sequences. Furthermore, the 
term in brackets in (57) is (¢,;, c), so that (58) can be written 


E x(t) = ey D > (ppj, €)?. (58) 
=—oa J=1 
Letting § be the subspace of /2 spanned by the {¢;;}, and Psc the pro- 
jection of the sequence c into &, (58) is 


E x(t1) = o2||P cll? s o2llell?. (59) 


We will now bound |lc||2. 
Define the function w(t), -»° <t < ~, by 


w3(t) = g(t; — ¢), 
with ¢, still held fixed. Then 
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c(n) = w,(nT), 
and Proposition 1 yields 
Ls -9@ k 
Cre Wall =) 


where W,(f) is the ordinary Fourier transform of w(t). Since W,(f) = 
G* (f)e—!27/ti we have, from (15), 


lel2=T ne [CrP |2df 


1 1/2T = Gs (4) seein, 2 
k=—o@ T 


T J-1/2T 
Combining this with (59) yields Theorem 5. 

Proof of Theorem 6: We begin by observing that the entire system 
described in Section V (up to the slicer in Fig. 3) is linear and the noise 
is additive and independent of the data. Thus, the error sequence 
\a; — a;}*., can be written as the sum of two sequences {8}. and {v;}".. 
The sequence {;} is data dependent and is of the form 


df. 








By = Li ajay. 
J 


In fact, the sequence {§;} is the output of box D in Fig. 3 when the noise 
z(t) = 0. The sequence {y;} is due to the noise and is, in fact, equal to the 
output of box D in Fig. 3 when we set w(t) = 0. Since the data and noise 
are uncorrelated, so are {@;} and {y,;}. Thus the mean-squared error 


S225 3 (a= 05)? 
eo Gat 


ly 
= 6+ 1? 
1 ae | v 
=e eB, PS yy 
yo j=l po js 


> fl 


= tex. (60) 
We will overbound «7 and ez separately. 

We begin with ¢%, the error due to the noise. Thus we must over- 
bound 


éN = E+ ap v?, 
where {y;} is the Suter sequence ‘r box D in Fig. 3 when w(t) = 0. aA 
us define the sequence bo(-) to be the output of box C when w(t) = 
Then 


N 
vj = (bo, oj) = zs bo(n) oj (nr), 1<j<N 
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and 


N 
u > dj(n)dj(m)E bo(n)bo(m). (61) 


Next observe that bo(-) is a stationary random sequence with 
E bo(n) = 0, E bo(n)bo(m) = Rz,(n — m). The sequence R,,(n), — - < 
n < ©, is the inverse Fourier transform of its spectral density S,,(/), 
which is, from Proposition 1, 


cath = _k 
Srl = pICrMl? Nz ( 2: (62) 
Returning to oa we write 
oe | Ras m)oj(m) | (63) 
VY j=l1n=1 m=1 


The quantity in brackets in (63) is d;(n), where the sequence d;(-) is the 
convolution of the sequences R,,(-) and ¢;(-). Further, (63) can be written 
as 


ey 


RS, 
ite 


N 1N 
te oj(n)dj(n) = . x (o;, dj). 
Rx = 


From the Parseval aa (14), we have 
TN 1/2T 
ha> df Dir vjr(Adf, (64) 


where ®;7(f), Djr(f) are the transforms of ¢;(-), d;(-), respectively. 
Furthermore, the convolution theorem (13) yields 


Djr(f) = Bir P)So(/), 
so that (64) becomes 


v /2T 

2 fon Self 8ir PLA 
* § ; 2\d 
_, Sool) (> 2,1) 


Fo jae SEM URAL: 


Using Theorem 7 (Appendix C), we obtain 


N Ff v 
<= J. (T Sy(f))df + | sup Sil | : X 18’; |? 
v —F Vj=1 


F<|f|<1/2T 
N fF k le 
= 7 J, ler@l? = Nz (F-5) df + Az” & (1-N) 


which is (41). 
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It remains to verify (42), which is an upper bound on the data- 
dependent error or intersymbol interference error 7. Thus, set the noise 
z(t) =0, and 

2 ee A y) 
gq=E-> (aj; — aj)”. (65) 
Vj=1 


We begin by observing that, since the sequences ¢;(-), j = 1,-++,v, are 
orthonormal, any sequence co(-) can be written as 


co) = X vjids6) tre), (66) 

J=1 
where (r, ¢;) = Oand 7; = (co, $j), 1 SJ S v. Applying (66) to the se- 
quence D(a — a), where d(-) and a(-) are as defined in Section V and 
D = Dn is the index-limiting operator defined in Section II, we ob- 


tain 


D(a-a) = = (D(4 a), 6) 4; +r) 
£ 


= = (a —a, $j) oj +r, 
= 
where (r, ¢;) = 0,1 <j Sv. Thus 


|D(@ — a)? = ¥ (4 —a, 6)? + Url? 
z 


2 A OO) % (67) 
j=l 
Now from (381), 
(a, pj) = aj, Lay Sw, 
and from (35a), 
Thus (67) is 
|D(4 — a)*f2 = (6; - a), 
jJ=1 
so that the mean-squared error, 
ae peat ape 1 P 
q@2=-E > (a — aj) <-E|D(4é —- a)? (68) 
v j=l v 
Ineq. (68) relates the error ¢? to the error which the system makes in 


transmitting the sequence a(-). 
We proceed to overbound (1/v)E|| D(a — a)||2. We now define 
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I @ k k 
Hr(f) = CPB) = Fr) © G (f—75) He (f- 5): 

(69) 
Br(f) is defined by (42b). It is easy to verify that H(f) is the (sequence) 
transfer function of the overall system from the input to box B at the 
transmitter (Fig. 2), through the channel [defined by (33)], and through 
the sampler and box C at the receiver (Fig. 3). Thus with h(-), the inverse 
transform of H7(f), 


a(n) = = h(n — m)a(m), -ox<n<o, (70) 


Further, the error sequence 


a(n)-a(n)= > a(m)[h(n — m) — bn m] 


= % a(m)u(n —m), —-x<In<o, (71a) 

where the sequence u(-) is defined by 
u(n) = h(n) — bro, —-o<n<o, (71b) 

The transform of the sequence u(-) is 
Ur(f) = Ar(f) — 1. (72) 


Now, with a; (-) as defined by (81a), we define the convolution of a; (-) 
and u(-) to be 


Up(n) = (apxu)(n) = = a,(m)u(n — m) 


m=— ox 

N(k+1) 
= > ap(m)u(n — m). 
m=Nk+1 


Since a(-) = 3 a;,(-), (71) is 
h==o 


G-a=axu= > vp. (73) 
We next introduce the time-truncation operators D\*), -~ <k < &, 
defined by 
bo(n), Nk+1sn<N(k+1), 
0, otherwise. 
Of course, D = DM). Then, from (73), 


(D)b)(n) = | (74) 


1 : 1 ; 
: |D(é -— a)||2 = : D(a — a)||? 


~ Dv, 


k=—0 


1 
v 











2 1 © 
| =- F  (DOvp_, DOvy). (75) 


V k,k’=—0 
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Now let us observe that v, depends (through a;,) on the data symbols 
in and only in the kth data block 


Apkti,*** 5 Ay(k+1): 


Since all the data symbols are assumed to be statistically independent, 
we conclude that uv; and v;’ (k # k’) are also statistically independent 
and uncorrelated. Thus 


E(DOv,z, DOuvz,+) = 0, RAR’, 
and (68) and oe are 


e< 
qs 





a> 


=—o0 





(76) 


We now make another een about the sequence vu; (-). As we 
observed in the proof of Theorem 4, the N random variables {a;,(n)}N&#), 
have the same statistics as the N random variables {ao(n)}*_. It follows 
that, for —~ <n < ~, v;(n) has the same statistics as Ug(n — Nk), so 


that 


N 
E||\Dv, ||? =E DY vg(n) 
n=1 


N . N(-k+1) 
=E > vi(n-Nk)=E > van) 
n=1 n=—-Nk+1 
= E||D voll. (77) 
Substituting (77) into (76), we have 
LT. = 1 
75- DL ElDKHvoll2 = = Ellvoll?. (77a) 
V p=-—o@ vp 


Now from the convolution formula (13) and the Parseval rela- 
tion (15), 


ae E|lagu|? 
=-TE {" lAr|UrMlar (78) 
Since 
Ar(f) = = aj dir (f) 
and the {a;}{ are uncorrelated, 


BlAr(f)/? = o2lbr(f)2 (79) 


Substituting ee and (78) into «7 we have 
Ged 2 
G<=T a JUr)? & ofl ayrOVPat 


v 
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r ("jure & lend 
J_, Url & eras 


+ 


Game Ae. 2 
y ta Vaan LUA PirOPat 
<= 7N f., |Ur(f)|2df 


v 
+7 sup [Ur(f)|? = |®(A) Pdf. 
Vo F<|f|<127 Fs|f|S1/2T 
Using (72) and (69), which ae U(f), and the definition of A3 (42c), 
we obtain 


doz (— tS |Cr(Br(f) — 1]2df 


+ Ag+ 7 3 lava 


Since ||8’¢;||2 = 1 — \j, we have established (42), completing the proof 
of Theorem 6. 


APPENDIX A 
Proof of Theorem 3 
Let T, F > 0, with W * FT < 1/2 be given. Let N = 1, 2,---, also be 
given. Define the sequence y(-) by 
in 27W 
y(n) =e <n <m, (80) 
ray) 


The transform of y(-) is easily seen to be 


— 


» Wf SF 


Pr(f) = YL y(njeni2fTn = (81) 


n=—-© 


1 
0, F<(|f| < oT 


The bandlimiting operator 4 = #; is therefore defined by b = Ba, where 


Br(f) = TrArff). 
Let K be the NXN matrix with (m, n)th entry y(n — m), 1 <n,mS 


N. Consider the matrix eigenvalue equation 
Ka = da, (82) 


where @ = (a1, a2, +--+, ay)’. Equation (82) is equivalent to 
N 
>~ y(n -— m)an = Aan, lsnQN. (827) 
m=1 
Since K is asymmetric matrix, eq. (82) or (82’) has N (not necessarily 
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distinct) real eigenvalues A; = Ag = --- = Aw and acorresponding set of 
N real orthogonal eigenvectors a; = (aji,--+, ajn)’, 1 SJ SN. Weas- 
sume that the eigenvectors are normalized so that aja; = 6;;. Also we 
can write 


N 
DY y(n — M)ajm = djajn, 1l<n<N. (83) 
m=1 
We can now define our sequences {¢;(-)}. Let 
Ge Laren, 
(n) = 84 
$j (n) ie otherwise. (84) 


Equation (83) is therefore 
N 
DL y(n — m)gj(m) = Ajo (n), Lene, (85) 
n=1 


which is equivalent to 
DBO; = rj oj; 1<j<N, (86) 


where D = Dn, and B = Br. This is Theorem 3B. 
Now, for 1 <7 <N, let c;(-) = Bd;. Then (86) implies that 


cj(n) = dj; (n), l<n<N. (87) 
Thus 


N N 
(ej, 6) = De eln)gjln) = L Ajoj(n) 


= dj ll; ll? = Aj. 
Further, since the transform of c;(-) is C)r(f) = 'r(f/)8;r(f), the Parseval 
relation (14) yields 


1/2T 
dy = (cj, 0)) = T ifn op ET Sir Pin Naf 


F 
= . 2 
T fe |; (f) |2df. (88) 


From (88), A; < ||¢;||2 = 1 and A; = 0. In fact, if \; = 0, then ®;(f) = 0 
for |f| < F. But, since @;(f) is a polynomial in e~!27/7, it vanishes on an 
interval only if it vanishes identically, which contradicts ||¢;|| = 1. Thus 
\; > Ofor1 <j < N. Since D1’ \; = trace K = 2WN, the {d;}" can be la- 
beled so that they satisfy Theorem 3A. 
Now Theorem 3C follows from 
(dj, Pk) = Aap = dyn, 


Theorem 3D is established as follows: 


1/2T 
(Bo;, Box) = (ej, ce) = T f pn iT Cin haf 
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1/2T ; 
=T f op Er) Pir Air Hat 


(1) 
= (Boj, or) = (DBd;, br) 


(2) 
= hj (oj, Oe) = AjOjr 
Step (1) follows from the fact that ¢, has support on [1, N] so that for 
-any a(-), (a, 2) = (Da, oz). Step (2) follows from Theorem 3B. 
To prove Theorem 8F, observe* that 
N 
> Aj = trace K = 2WN (89) 
j=l 
and 


N 
\P=trace(K'K)= DY y2(n-—m). 


nym=1 


iMe=z 


j 
Substitution of the formula for y(n) (80) and a simple computation 
yields 


N 
> A? = 2WN — 0 (log N), (90) 
1 
as N — ~, Combining (89) and (90), we have 
1N TN O(log N) N 
N= Aj(1 — Aj) =e Aj ~ A} Sg 
Let S = {j: 6 < Aj < 1 — 6}. Then (91) yields 
1 . 1 N N 
nv? (card S) < NY Aj(1 — Aj) —> 0, 


0. (91) 


which is, on dividing by 62, Theorem 3E. 

Theorem 3F follows directly from Theorems 3A and 3F. Theorem 3G 
is established in Ref. 4. Finally, Theorem 3H follows immediately from 
the definition of the ¢;(-), Aj, lL <j SN. 


APPENDIX B 
Saltzberg’s Bound 
Saltzberg’s bound? states that if £ is arandom variable defined by 
f= do joj, 
jJ=H-©@ 


where {a;}=.. are i.i.d. copies of the r.v. a, defined by (29), and {n;} are 
fixed coefficients, then the moment generating function of & 


s2o? 
M;,(s) = E e%#! 2 exp eae s=0, (92a) 


* This trick is used in Ref. 7. 
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where 


oj = Var &) = 04 3 n?. (92b) 


fee 6 
The right member of (92a) is the moment-generating function of a 
Gaussian r.v. with zero mean and variance o}. 

Now let & = & + &, where £; is as above and £9 is a Gaussian r.v. with 
zero mean and variance o%. Let £; and £4 be statistically independent. 
Then, from Saltzberg’s bound (92), the moment-generating function of 


g, 
2 
M;(s) = Mz,(s) « Me,(s) < exp Es (0? + |, $20. 
It follows from the Chernoff bounding technique that, for r > 0 


r2 
Prié > r} < a ae ae 93 
he Saree | 2(03 + A| ee) 

Let us now apply (93) to establish the claims made in remarks iv and 
v in Section V. In remark iv, observe that x(t) is a random variable of 
the form of £1, i.e., a linear combination of the data symbols {a;}. If we 
apply (93) with £ = x(t), & = 0, we obtain the inequality of remark 
Lv. 

Next consider Remark v. Observe that, due to the linearity of the 
system, the error &; — a; is of the form of &, with of + 03 = ¢?. Thus (93) 
yields 

—1 
Pr{(a@; — aj) > 1} S exp (sal: 
Cj 
Since &; # aj, only when |&; — a;| > 1, we have 


= 
P.j = Priaj; # aj} S 2 exp {af 
26; 


APPENDIX C 

Theorem 7: Let {;(-)}Xy be an orthonormal set of sequences (in lg) with 
support on [1, N]. That is, Dy¢; = $; and (¢;, dy) = Sjn,1 S j,k SN. 
Let ®;r(f), -~ <f < ~, be the Fourier transform of $;(+),1 Sj SN. 
Then 


N 
2% |erlhl?= N, —o<f<o, 
‘= 


Proof: Let v(n) = e—#27/T” or 0 according as n e€ [1, N] orn ¢ [1, N]. 
From the orthonormality of the {¢;}, we conclude that they span the 
N-dimensional space of complex-valued sequences with support on 
[1, N]. Thus we can write 


N 
v(n) = Xu v;o;(n), -o<n<», 
= 
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where 


N 
vj = (v, oj) = u u(n)o;(n) 


=, e-i2fTrg. (n)= Bjr(f), 1<5j5N. 


n=-—~-0 


Furthermore, the orthonormality of the {¢;} also implies 
N N N 
YH jo(r)PP= dX fojl?= dV | Br? 
n=1 j=l j=. 
Since |uv(n)| = 1, we have established the theorem. 
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Sound Alerter Powered Over an Optical Fiber 


By B. C. DeLOACH, ur., R. C. MILLER, and S. KAUFMAN 
(Manuscript received July 18, 1978) 


An optically powered sound alerter has been constructed which 
demonstrates the feasibility of converting optical power into sound 
power with good efficiency and at power levels comparable to those of 
present telephone ringers. The alerter has an overall optical-to-acoustic 
efficiency of about 35 percent at 2 mW of acoustic output power. Optical 
power is converted to electrical power by a 52-percent efficient pho- 
tovoltaic detector and then into acoustical power by a 72-percent ef- 
ficlent electroacoustic tone generator which uses a piezoelectric 
transducer. This demonstration establishes that it is technically fea- 
stble to deliver optically, via a fiber lightguide, sufficient power to op- 
erate a telephone, since all other telephone signaling functions can be 
accomplished, in principle, with less power and within the context of 
dielectric lightguide technology. For conventional usage, the design 
of a telephone alerter must take many factors into consideration, in- 
cluding background noise masking, frequencies not irritating to the 
customer, satisfactory performance for customers with impaired 
hearing, etc. These factors have not been addressed here. 


l. INTRODUCTION 


The potential introduction of lightguide connecting residential and 
commercial premises to central switching offices offers exciting possi- 
bilities for communications users. The availability to each customer of 
hundreds of megahertz of inexpensive switchable bandwidth could re- 
volutionize telecommunications. One cost barrier to the employment 
of lightguide in the local loop would be eased if the guide could also be 
used to provide the essential functions of ordinary telephone service 
without requiring metallic wires to carry electrical power to the tele- 
phone. The possibility would then exist for introducing a lightguide 
telephone system; the essential functions of this system would be pow- 
ered from central offices, and broadband services could subsequently 
be added to it in a cost-effective manner. The broadband services and 
non-essential telephone services could be locally powered. 
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The largest technical uncertainty limiting the consideration of di- 
electric lightguide for ordinary telephony is power: Can telephone op- 
erating power requirements realistically be met by photovoltaic con- 
version of optical power emergent from the lightguide? Since the largest 
power demands in a conventional telephone occur when the bell is rung, 
we have given first priority to investigating the power efficiency of an 
optically driven sound alerter. The other essential functions—speech 
signaling and recognition of the telephone hook status—will be discussed 
in a subsequent report. 


ll, AN OPTICALLY POWERED ALERTER 

Electromechanical ringers of the TRIMLINE® and 500D-type tele- 
phones produce multitone outputs in the range from 0.4 to 0.6 mW of 
acoustic power; the simultaneous sounding of five ringers (extension 
telephones) produces 2.5 mW of acoustic power; and the S1A “hard- 
of-hearing” alerter produces 4 mW of acoustic power. Cost, physical size, 
and customer acceptability of the alerter noise are important features 
in the design of these ringers and in the past have dominated the eco- 
nomic importance of high electroacoustic efficiency. However, if the 
acoustic power levels just listed are to be realized (or even approached) 
with optically powered sound alerters, then the attainment of high op- 
tical-to-acoustic power conversion efficiency is a matter of paramount 
necessity—at least within the boundaries set by present laser and 
lightguide technology. 

We have fabricated an optical sound alerter demonstration unit, 
shown in the photograph of Fig. 1 and block diagram of Fig. 2, in which 
optical power from a GaAlAs laser is transmitted through a large nu- 
merical aperture, low-loss, optical fiber and is air-coupled into a GaAlAs 
photovoltaic detector! where it generates dc electrical power at 1.0 volt. 
This is converted by attendant circuitry into an audio frequency wave- 
form at the terminals of an electroacoustic tone generator which uses 
a piezoelectric transducer. The components are mounted on a 16-in. X 
10-in. X 44-in. Lucite board with compact X-Y-Z positioners used for 
optical alignments. The 0.823-in. high X 1.156-in. diameter Helmholtz 
acoustic cavity was constructed as an integral part of the board, with the 
tone generator acoustic output coupled into the surroundings via a 
Helmholtz air piston consisting of 97 holes of 0.078-in. diameter drilled 
through the 4/,-in. Lucite thickness. A surface-tension microlens, formed 
by wetting the cut end of the optical fiber with a small drop of optical 
cement and curing in ultraviolet light, increased the laser-to-fiber cou- 
pling efficiency from 45 to over 60 percent. The photovoltaic detector 
quantum efficiency was improved by applying a thin-film, ZrOs, anti- 
reflection coating which reduced the net reflection loss to 2.5 percent. 

To simulate conventional telephone ringing, the laser transmitter is 
operated in a 2-second-ON, 3-second-OFF cycle (but is otherwise un- 
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Fig. 1—Optically power sound alerter demonstration unit. Laser, on the right, is coupled 
by large N.A. fiber lightguide to the photovoltaic detector. Tone generator and high-Q 
inductors for the ringing choke circuit are at lower left. 


modulated), causing the alerter to respond similarly. These periodic 
bursts of optical power are converted into electrical power by the pho- 
todetector, and this power is converted into an audio signal by a free- 
running, 2.0-kHz nominal frequency, astable multivibrator and an 
acoustically damped, ringing-choke circuit whose capacitive element 
is the tone generator. A portion of the acoustic response curve of the tone 
generator at room temperature is shown in the inset of Fig. 2. A raucous, 
buzzer-like sound is produced by a second multivibrator which sweeps 
the audio frequency over a +100 Hz interval about 1.98 kHz at a 30-Hz 
rate. The ringing-choke and frequency-control circuits are operated 
directly from the photovoltaic detector output terminals. 

Selection of the 2-kHz tone generator was based on measured com- 
parisons of its acoustic output power with that of similarly designed 
1.0-kHz and 1.4-kHz tone generators, the results being qualitatively 
consistent with acoustic efficiency scaling laws. The +100-Hz frequency 
modulation ameliorates some of the more irritating effects associated 
with the use of a single, unmodulated tone—e.g., strong standing waves 
(and hence dead zones) in a room, an intensely piercing sound, and 
masking of the alerter sound if the noise environment contains pure tones 
of similar frequency. The expedient adopted for this demonstration is 
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Fig. 2—Schematic of sound alert demonstrator. Laser is battery-operated, with a timing 
circuit controlling the 2-second-ON, 3-second-OFF ringing cycle. Optical acoustic efficiency 
was measured with the +100-Hz sweeper disconnected and the optical fiber replaced by 
lenses. A 45F transistor is used for 7. The values of L1, Le, and C, are 0.24h, 0.16h, and 
0.0033 uf for 1.98-kHz operation. The tone generator room temperature acoustic output 
power at 5.0 V rms is shown in the inset for a limited frequency range. 


a departure from usage in existing multitone telephone ringers, which 
incorporate widely spaced frequencies in the range 750 to 1600 Hz to 
satisfy human factors criteria. Our optical techniques can be extended 
to multitone systems with some loss in electroacoustic efficiency. It 
should also be noted that the volume occupied by the present tone 
generator and the high-Q inductors of the ringing choke circuit is larger 
than that consumed by conventional electromechanical telephone 
ringers. 


lll. EFFICIENCY 


As seen from the Fig. 2 inset, the tone-generator output power varies 
by about +5 percent over the +100-Hz swept band. To obtain a definite, 
single-frequency result, the sound alerter optical-to-acoustical power 
conversion efficiency was measured at 1.98 kHz with the +100-Hz 
sweeper disconnected. This sweeper consumes approximately 3 percent 
of the photovoltaic output power when used in the sound alerter dem- 
onstration unit. Accuracy in the efficiency measurements required that 
the optical fiber be replaced by a lens system which focused the \ = 0.801 
pm laser output, with correction for astigmatism,2* onto the photovoltaic 
detector. (Auxiliary measurements show that the fiber-to-detector 
coupling efficiency exceeds 95 percent.) Optical powers were measured 
with a thermopile whose calibration against a pyroelectric radiometer 
of +0.5 percent absolute accuracy agreed to within +1.1 percent with 
its standard lamp calibration. Acoustic powers were measured as func- 
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tions of frequency and rms voltage in an absolutely calibrated anechoic 
test chamber. Electroacoustic efficiencies were obtained from this cal- 
ibration and the voltage and current waveforms and phase angle at the 
tone generator. 

A summary of the optical, electrical, and acoustic powers and power 
conversion efficiencies obtained in this measurement is given in Fig. 3. 
The overall efficiency, defined as the total sound power divided by the 
optical power incident onto the photovoltaic detector, achieves a max- 
imum value in the range 33 to 36 percent at acoustic powers in the 
neighborhood of 2.1 mW. With our present unsophisticated circuits, the 
alerter efficiency decreases slowly and uniformly as the acoustic (or 
optical) power is lowered, and it decreases abruptly if the power is raised 
too high. This behavior follows from the shape of the photovoltaic V-I 
curve and from the fact that the ringing choke circuit presents a load at 
the detector output terminals which is nearly independent of optical 
power and which permits optimum photovoltaic efficiency to be achieved 
only over a narrow optical power range. Thus, the total acoustic power 
of 2.1 mW might be distributed among three or four alerters ringing si- 
multaneously, but only if the present circuit were modified to maintain 
an optimum efficiency impedance match to the photodetector. Reali- 
zation, in the present arrangement, of a previously attained “best” value 
of photovoltaic efficiency! could raise the overall efficiency into the 37 
to 40 percent range. 
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Fig. 3—Power and efficiency. (a) The indicated values represent optical power incident 
onto detector, detector dc output power, circuit audio frequency power into tone generator, 
and tone generator total radiated acoustic power at 1980 Hz. (b) The photovoltaic efficiency 
was 52 percent, and the tone generator efficiency was 72 percent for the results cited in 
this paper. A value of 58 percent has been measured on another detector of this type, and 
the present tone generator can provide 75 percent electroacoustic efficiency (over a nar- 
rower frequency band) by increasing its cavity height to 0.861 inch. 
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IV. EXTRAPOLATION TO FUTURE TECHNOLOGY A POWER ESTIMATE 


The losses expected for the large N.A. fiber at the 0.801-um wavelength 
used in the present demonstration alerter exceed 6 dB/km. Thus, even 
with perfect laser-to-fiber and fiber-to-detector coupling, at least 5 watts 
of optical power would be needed at the input end of a 5-km fiber to 
produce 5 mW of light at the photodetector, a large power indeed. 
However, fiber losses as low as 1.2 dB/km have been reported? at wave- 
lengths near 0.9 um, which are attainable with GaAs lasers and photo- 
detectors, and as low as 0.5 dB/km at wavelengths near 1.3 um, which 
are attainable with In, G,_,As,Pj_, devices. 

For purposes of estimating the power that might eventually be needed 
at the central office to operate one optically driven sound alerter, we will 
consider the more optimistic case corresponding to 1.3-um wavelength. 
In expectation that multi-junction® InGaAsP photovoltaic detectors can 
be constructed whose power conversion efficiency will be comparable 
to that of the present GaAs detector, we arrive at the values listed in 
Table I. 

A 10-percent efficient InGaAsP laser driving a 5-km loop needs to 
draw only 0.10 watt during the alerter sounding (and considerably less 
during speech communication). For comparison, we note that present 
Bell-System-wired telephones draw at least 20 mA from a 50V battery, 
i1.e., 1 watt, when operating, and they require more to ring. 

The alerters on conventional extension telephones are rung simulta- 
neously, but the correct engineering approach to ringing several optically 
powered telephones in one location is not obvious. A 1-second-ONn, 
2-second-OFF ringing sequence with appropriate circuits to switch be- 
tween extensions would permit the ringing of three telephones with the 
same power as noted in Table I, provided the 2-second delay between 


Table 1 — Central office power for \ = 1.3um laser to operate one sound 


alerter 
Assumed acoustic power per alerter = 0.6 mW 
‘Tone generator electroacoustic efficiency = 75 percent* 
DC-to-audio circuit efficiency = 90 percentt 
Photovoltaic power conversion efficiency = 58 percent} 
Fiber-to-photodetector coupling efficiency = 93 percent 
Optical power required at fiber output end = 165m 
Fiber loss (N.A. = 0.2) = 0.5 dB/km 
Cabling losses (including splices) = 0.8 dB/km 
Total medium loss = 1.3dB/km 
Laser-to-fiber coupling efficiency = 80 percent 
Optical power required at central office for fiber length of: 1 km 2.8 mW 
2 km 3.8 mW 
5 km 9.2 mW 
10 km 41.2 mW 


* Present best value, near 2 kHz. 
+ Assumes that circuit load line is matched to photovoltaic optimum power point for 
the optical power actually used. 
t Assumes that multi-junction photovoltaic efficiency can be made equal to single 
junction efficiency. 
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ringing the first and third telephones is tolerable. The ringing of more 
extensions and of a hard-of-hearing alerter would probably require the 
assistance of local power, as would additional services such as wideband 
video. 

One reason for examining the possibilities of optical systems in the 
loop plant, in today’s time frame, stems from the expanded services to 
telephone subscribers which are implicit in the hundreds of megahertz 
of switchable fiber bandwidth. Beyond this, there are many technical 
advantages, including freedom from lightning strikes (low voltage 
electronics), power line pick-up, and problems with potentially dan- 
gerous electrical pick-up from customer-provided electrical equipment. 
Optical loop systems would be more difficult to tap and thus somewhat 
more secure than wired systems. They may also exhibit compelling cost 
advantages and power savings, but today’s technology is much too ru- 
dimentary to permit firm predictions. 


V. SUMMARY 


We have measured an optical-to-acoustic power conversion efficiency 
of 33 to 36 percent on an optically powered sound alerter driven by a 
0.801-um wavelength laser. Assuming that comparable efficiency can 
be attained in the 1.3-um wavelength range of low fiber-lightguide losses, 
this result suggests that the optical powering of telephones over glass 
fibers may in the future be technically feasible for loop lengths up to at 
least 5 km—in the sense that the other essential station set signaling 
functions can be performed compatibly with an all-dielectric technology 
at less power than that consumed by the sound alerter. 

Since the high efficiency is made possible by restricting the alerter 
acoustic output to a narrow range of high frequencies, the tone quality 
of optically driven alerters would not be expected to equal that of con- 
ventional electromechanical ringers. Also, it is unclear, with limited 
optical powers, how best to handle the problems of ringing extension 
telephones and hard-of-hearing alerters. 
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